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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Gatanaga, T. 

Granger, G.A. 

<ii> TITLE OF INVENTION: Factors Altering Tumor Necrosis 
Factor Receptor Releasing Enzyme Activity 

(iii) NUMBER OF SEQUENCES: 154 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: MORRISON & FOERSTER 

(B) STREET: 755 PAGE MILL ROAD 
<C) CITY: Palo Alto 

<D> STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94304-1018 

(V) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Diskette 
<B) COMPUTER: IBM Compatible 
<C) OPERATING SYSTEM: Windows 
<D) SOFTWARE: FastSEQ for Uindows Version 2.0b 



(vi> CURRENT APPLICATION DATA: 
CA) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: USSN 09/081,385 
<B) FILING DATE: 014-NOV-1998 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: 

<B) REGISTRATION NUMBER: 

<C> REFERENCE/DOCKET NUMBER: 22000-20577.21 

Cix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 650-813-5600 
<B) TELEFAX: 650-494-0792 
<C> TELEX: 706141 



C2) INFORMATION FOR SEQ ID NO:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 

AAGCTTTTTG CTTTCCTTCC CCGGGAAAGG CCGGGGCCAG AGACCCGCAC TCGGACCAGG 60 

CGGGGGCTGC GGGGCCAGAG TGGGCTGGGG AGGGCTGGGA GGGCGTCTGG GGCCGGCTCC 120 

TCCAGGCTGG GGGCCGCCAG CTCCGGGAAG GCAGTCCTGG CCTGCGGATG GGGCCGCGCG 180 

TGGGGCCCGG CGGGGCGGCC TCGGGAGGCG TCCAGGCTGC GGGAGCGGGA GGAGCGGCCG 240 

TGCGGGCGCC AGCGCCGTGG GTGGAGGTCG CCGTCCCTCC TGAGGGGCAG CCAGTGCGTT 300 

TGGGACCCGG GAGCAGAGCC CGCGCCTCCC CAGCGGCCTC CCCGGGGGTC TCACCGGGTC 360 

ACCCGAGAGC GGAGGCCCCG GCTCCGCAGA AACCCGGGGC GGCCGCGGGG AAGCAGCGCC 420 

CTCAGGCGTC GGAGGAGCCC CCAGAAGGAC CTCGCGCCTT CCCGCCGGGC TCCGACCGCC 480 

TGGGTTCGGT GCGGGACGGC CCAGGCCGCC AGGACCCCCA AGCGCAGCTC AGTCTGCGGG 540 

GCACGACCCA GAGGCCAGCA GCAGAGGACG GGGCCGGGGC CGGGAGAGGG CGGGGAGGGC 600 

GCTCCTGGGA GGTCAAGGCC AGGGCTAGAC TTTCAGGGTC ATGGCCTGGC CCCTCATCCC 660 

CAGGGAGGTG AGGGGGCTCT GTGAGCAGAG GGGGCCCCGG TGGAGAAGGC GCTGCTAGCC 720 
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AGGGGCGGGG CAGGAGCCCA GGTGGGGACT TAAGGGTGGC TGAAGGGACC CTCAGGCTGC 780 

AGGGATAGGG AGGGAAGCTA GGGGTGTGGC TTGGGGAGGT GCTGGGGGAC CGCGGGCGCC 840 

CTTTATTCTG AAGCCGAATG TGCTGCCGGA GTCCCCAGTG ACCTAGAAAT CCATTTCAAG 900 

ATTTTCAGGA GTTTCAGGTG GAGACAAAGG CCAGGCCCAG GTGAAAATGT GGCAGTGACA 960 

GAGTATGGGG TGAGAACCAC GGAGAGAGGA AGTCCCCGAG GCGGATGATG GGACAGAGAG 1O20 

CGGGGACCAG AATTTTTTAA AACGCATCTG AGATGCGTTT GGCAGACTCA TAGTTGTTTT 1080 

CCTTTCACGG AGAAAGTGTG GGCAGAAGCC AGCTCTAAAG CCCAGGCTGC CCAGCCTGCA 1140 

CTGGCAGAGC TGACGGAAGG CCAGGGCAGA GCCTTCCCTC CCTGTCACAG ACATGAGCCC 1200 

TGGAGATCTG GMTGAGGCA GATGTGCCCA GGGAAAGCTG ATCCGCCCCG ACCCAGGGCC 1260 

CCCCGGGTGC CCCTTTGAGC GTGGAATCGT TGCCAGGTCA TGGCTCCCTG CTATCGAACA 1320 

CCGGACACGG GTCGTGTGCT GCACCTGGCA GTTGCAGGAC CGACACCCAC AATGCCTTAA 1380 

GAGGTGATGA CTGCCTTCCA GGGGCCTGGC TGGCTGACAC TTTGCATGGC TCCTGGAGAA H40 

GAGGGATTGA GTGGAGTCCA CGGGTCATGG CCACGTCCTG GGTGCTGCCT CTGAGGCAGG 1500 

GCCCGGCTGG GGTGAGAAGG GGCTGGAGAC AGGTTCCTGC CAGTTCAGCC TCTAACCGGT 1560 

GGTCTTCATG CCTAGGAACC CACTGGGGGC TTATGAAACT GCAGGTGGCT GAGTCCTTGC 1620 

CATGGGGTCT CTCCTTCAGG AGGTCTGGGT GGGGCCGGAG ACTGTACCCC ACAAAGGGTC 1680 

CCAGGTGAGG CGGATGTGGC CTGGCGCTGT GTGGCTCTGG ACCTAGTCCT TGGGCTTGGG 1740 

CTGGCGCCCA GGGCCTGGGC TTGAGACAGC TGTGACGCAG GCAAGCCATT TACCCCGTTT 1800 

GTGGGGACAT TACATCTTCC TAGCTTGGAA CACACAGGCA GCCAGGGTTG TTATCCACAT 1860 

TCCTCCTCCA TGTTCTTCTC TTGAGAACTT TTACCAGGTA TGTCAGGAGC TGGGCTCCAC 1920 

CAGGGAGACT CAAGTGGAAA GCCCTCATCC TTGTCCTCCA GGAGACAGGA AAACCTATGG 1980 

TTACAATTCC AGGGACAAGA GCGATGCATG TGAGGTGTGG CAAATCTCAC TGTTCAACTG 2040 

GAGAAATCAG AGACAGCTTC CTGGAGGCAG TGACACCTGG ACAGGCTTCT CCACAGGAGG 2100 

AAGCGAGTGA GAGAAGCCAA CTGGGATGGA CCCATCATGT AGGGGGAACA GTGCGCGCAG 2160 

AACCAACAAC CACCCCCACC CTAGGCCCAG AGCTCACGGA GAGAGCTGGG CCTCTCGGGG 2220 

TGACTACATA GTTCCCTGCT GGATCTTAGG TCTTGTCCTT GGGCAGCTCT GCTGAGACCT 2280 

CTATGCCTGT TCCAGGCTGC ACCAAGGTTT TGTGACTATT GGTCTGGGGT TGTTTTGCAG 2340 

CAACTGAAGT GTTCTGTTGT AAMCAGGCA CTTGATTTGC TGGAAGGAAT GCTGTTTGTT 2400 

CTTGCTGCGA CAAACATTGA GCAGCATTTA GTGGGCGGTT TATATC7TGT GGAGTAATGG 2460 

GTGTTTTTGA AGTCTGTCCT GGGTACTGCA CATTAAAAGG AATATCATTT TCTGAAACAT 2520 

TGCTATTTTC CACACCAGAA ATCATATCCT CTTGCTGGTC CATGTCTGAA GACCTTACAC 2580 

GAGAAAGTCT TAATGTAAGT TTAGTAGAGT CCTTGGATGG AGAACTAATT ATATCATACA 2640 

TTGCCGCTTT CTCACTCTGC TCTTTTTCAT CCTTGCCTAA TTTCATTTTC TTCTGCTTCT 2700 

TTTGTTTTCT TTCTGGAGAA TCTAGCAAGA TATCTGGTGG AACATCTCGA GGTGATGAAC 2760 

AAGGTAGAGA CTGAGATTGT AGGATTAAAG GTGGTCTTGA GCCTTTAGGA GTTCCTTCAC 2820 

TTCCAGCAGG, GGAGCATACT GGCTGTGGAG ATCTCAAGGG AAAAGATGCA GCATTCCTCA 2880 

TTGTTGAAGA ATCTCCATCG TCACTACTTA GCCTGTGCAC CATGTGTAGG TAGTCCTCAC 2940 

TTGAACCATG TCTAGGATTA TCAGCATGAT GATTAGCTGA ATTGCCAGAC AACGGACCAG 3000 

AAACTTTATT ATCATGTATG TTTCTCAAAC CACCTGCAAC AATGGGACTT GAIACCGATG 3060 

CTTGTTGCAT CTGTGGATGT GTTGTGTAAC TTGAAGGATG GGAATATGGC ATGTATCCTG 3120 

CAGGGCTTTG TGGGGCGTAT GGACTAGGCA CTGGGCTATT TTGCTGTGGC ATAAATCTGT 3180 

TCCCAGAGCT TGTCTGTGGT GGCACAAACC GGCTGGAGGG GCTATGTGAG ATAGTGGTTT 3240 

GTTGATAATT GGAAGATGCA GGACTACTGT GCATGGAATT CTGAGAAAGT TTATACTGAG 3300 

ACATCATCAT TCCACTTTGT ACATATCTGT TCTGCATGCT TTTCTCCCTG AAAACATTAG 3360 

GACTCCTTGC CAGGACGGCC TGCAACAAGA CTGGTATGTC ACCTTCTGGG TCATCACTGC 3420 

CAAGGTTATC TTTCAACTCT ATGTGATCTG TTGATACCTG GTTGAGGCTA TGGACAAGCT 3480 

GTGAAACCAA ATTGTCATCC CTACAAGCCA AAAGGCAGTT CACCTCTTCT GCTATTCGTG 3540 

CATTAAAGAG AAGGCTCTTT GTAGTTGTAG CAGGTAAAGG AGATGGAAGA GGCAGCTGGT 3600 

TCAGGAGGTC TGTGAGACTA GCAATCCCCG CAAGAGTAGT AATGGGGACA TGGGGCATAT 3660 

CCCCATTCAT CCTGAATTTC TGGAATGGTG TTGCCTATAA AAGTACTTAG TTCAGGTGCC 3720 

AGCTGTCATT ACTTCCCATT TCCCAAACAC TGGGCGAATC GGCGTCTGAA TCCAAGGGGA 3780 

GGCCGAGGCC GCTGTGGCGA GAGACTATAA TCCGGGCCGG GAGGGGGGGC GGCTACGGCT 3840 

CCTCTTCCGT CTCCTCAGTG CGGGGAACAT GTAGAGCCGG GGGGAGACCA GCCGAGAAGA 3900 

CAAATCGTTG CTTCTTCTTC CTCCTCCTCC TCCTTCTCCC ACATAGAAAC ACTCACAAAC 3960. 

ACCCGACCAC GGGCCCGAGC TACCGGGGGG GCATCGCCGC GGGCCCGGGA ACCAATTCTC 4020 
CTGTCGGCGG GGGCGTCCTT TGGATCC 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 739 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

GGATCCAAAG GTCAAACTCC CCACCTGGCA CTGTCCCCGG AGCGGGTCGC GCCCGGCCGG 
CGCGCGGCCG GGCGCTTGGC GCCAGAAGCG AGAGCCCCTC GGGGCTCGCC CCCCCGCCTC 
ACCGGGTCAG TGAAAAAACG ATCAGAGTAG TGGTATTTCA CCGGCGGCCC GCAGGGCCGG 
CGGACCCCGC CCCGGGCCCC TCGCGGGGAC ACCGGGGGGG CGCCGGGGGC CTCCCACTTA 
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TTCTACACCT CTCATGTCTC TTCACCGTGC CAGACTAGAG TCAAGCTCAA CAGGGTCTTC 300 

TTTCCCCGCT GATTCCGCCA AGCCCGTTCC CTTGGCTGTG GTTTCGCTGG ATAGTAGGTA 360 

GGGACAGTGG GAATCTCGTT CATCCATTCA TGCGCGTCAC TAATTAGATG ACGAGGCATT 420 

TGGCTACCTT AAGAGAGTCA TAGTTACTCC CGCCGTTTAC CCGCGCTTCA TTGAATTTCT 480 

TCACTTTGAC ATTCAGAGCA CTGGGCAGAA ATCACATCGC GTCAACACCC GCCGCGGGCC 540 

TTCGCGATGC TTTGTTTTAA TTAAACAGTC GGATTCCCCT GGTCCGCACC AGTTCTAAGT 600 

CGGCTGCTAG GCGCCGGCCG AAGCGAGGCG CCGCGCGGAA CCGCGGCCCC CGGGGCGGAC 660 

CCGCGGGGGG GACCGGGCCG CGGCCCCTCC GCCGCCTGCC GCCGCCGCCG CCGCCGCGCG 720 

CCGAAGAAGA AGGGGGAAA 739 

(2) INFORMATION FOR SEQ ID NO:3: 

Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 233 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CAAGAGTGGC GGCCGCAGCA GGCCCCCCGG GTGCCCGGGC CCCCCTCGAG GGGGACAGTG 60 

CCCCCGCCGC GGGGGCCCCG CGGCGGGCCG CCGCCGGCCC CTGCCGCCCC GACCCTTCTC 120 

CCCCCGCCGC CGCCCCCACG CGGCGCTCCC CCGGGGAGGG GGGAGGACGG GGAGCGGGGG 130 

AGAGAGAGAG AGAGAGAGGG CGCGGGGTGG CTCGTGCCGA ATTCAAAAAG CTT 233 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2998 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GGATCCAAAG AATTCGGCAC GAGGTAGTCA CGGCTCTTGT CATTGTTGTA CTTGACGTTG 60 

AGGCTGGTGA GCTTGGAAAA GTCGATGCGC AGCGTGCAGC AGGCGTTGTA GATGTTCTGC 120 

CCGTCCAGCG ACAGCTTGGC GTGCTGGGCG CTCACGGGGT CCGCATACTG CAGCAGGGCC 180 

TGGAACTGGT TGTTCTTGGT GAAGGTGATG ATCTTCAACA CTGTGCCGAA CTTGGAGAAA 240 

ATCTGGTGCA GCACATCCAG GGTCACAGGG TAGAAGAGGT TCTCCACGAT GATCCTGAGC 300 

ACGGGGCTCT GCCCGGCCAT CGCCATCCCT GCATCCACGG CCGCCGCCGA GGCAGCCAAG 360 

GCCAGGTTCC CCGACTGGAC CGAGTTCACC GCCTGCAGGG CCGCCTGGGC CCGCGCCTGG 420 

TTGGGAGAGC TGTCGGTCTT CAGCTCCTTG TGGTTGGAGA ACTGGATGTA GATGGGCTGG 480 

CCGCGCAGCA CAGGGGTCAC CGAGGTGTAG TAGTTCACCA f GGTATTGGC AGCCTCCTCC 540 

GTGTTCATCT CGATGAAGGC CTGGTTTTTC CCCTTCAGCA TCAGGAGGTT GGTGACCTTC 600 

CCAAAGGGCA GCCCCAGGGA GATGACTTCC CCCTCCGTGA CGTCGATGGG GAGCTTCCGG 660 

ATGTGGATCA CTCTAGAGGG GACGCCTGCA CTTCGGCTGT CACCTTTGAA CTTCTTGCTG 720 

TCATTTCCGT TTGCTGCAGA AGCCGAGTTG CTGCTCATGA TAAACGGTCC GTTAGTGACA 780 

CAAGTAGAGA AAAGCTCGTC AGATCCCCGC TTTGTACCAA CGGCTATATC TGGGACAATG 840 

CCGTCCATGG CACACAGAGC AGACCCGCGG GGGACGGAGT GGAGGCGCCG GAATCCTGGA 900. 

GCTAGAGCTG CAGATTGAGT TGCTGCGTGA GACGAAGCGC AAGTATGAGA GTGTCCTGCA 960 

GCTGGGCCGG GCACTGACAG CCCACCTCTA CAGCCTGCTG CAGACCCAGC ATGCACTGGG 1020 

TGATGCCTTT GCTGACCTCA GCCAGAAGTC CCCAGAGCTT CAGGAGGAAT TTGGCTACAA 1080 

TGCAGAGACA CAGAAACTAC TATGCAAGAA TGGGGAAACG CTGCTAGGAG CCGTGAACTT 1140 

CTTTGTCTCT AGCATCAACA CATTGGTCAC CAAGACCATG GAAGACACGC TCATGACTGT 1200 

GAAACAGTAT GAGGCTGCCA GGCTGGAATA TGATGCCTAC CGAACAGACT TAGAGGAGCT 1260 

GAGTCTAGGC CCCCGGGATG CAGGGACACG TGGTCGACTT GAGAGTGCCC AGGCCACTTT 1320 

CCAGGCCCAT CGGGACAAGT ATGAGAAGCT GCGGGGAGAT GTGGCCATCA AGCTCAAGTT 1380 

CCTGGAAGAA AACAAGATCA AGGTGATGCA CAAGCAGCTG CTGCTCTTCC ACAATGCTGT 1440 

GTCCGCCTAC TTTGCTGGGA ACCAGAAACA GCTGGAGCAG ACCCTGCAGC AGTTCAACAT 1500 

CAAGCTGCGG CCTCCAGGAG CTGAGAAACC CTCCTGGCTA GAGGAGCAGT GAGCTGCTCC 1560 

CAGCCCAACT TGGCTATCAA GAAAGACATT GGGAAGGGCA GCCCCAGGGT GTGGGAGATT 1620 

GGACATGGTA CATCCTTTGT CACTTGCCCT CTGGCTTGGG CTCCTTTTTC TGGCTGGGGC 1680 

CTGACACCAG TTTTGCCCAC ATTGCTATGG TGGGAAGAGG GCCTGGAGGC CCAGAAGTTG 1740 

CTGCCCTGTC TATCTTCCTG GCCACAGGGC TTCATTCCCA GATCTTTTCC TTCCACTTCA 1800 

CAGCCAACGG CTATGACAAA ACCACTCCCT GGCCAATGGC ATCACTCTTC AGGCTGGGGT 1860 

GTGCTCCCTG ACCAATGACA GAGCCTGAAA ATGCCCTGTC AGCCAATGGC AGCTCTTCTC 1920 

GGACTCCCCT GGGCCAATGA TGTTGCGTCT AATACCCTTT GTCTCTCCTC TATGCGTGCC 1980 

CATTGCAGAG AAGGGGACTG GGACCAAAGG GGTGGGGATA ATGGGGAGCC CCATTGCTGG 2040 
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CCTTGCATCT GAATAGGCCT ACCCTCACCA TTTATTCACT AATACATTTT ATTTGTGTTC 2100 

TCTAATTTAA AATTACCTTT TCATCTTGCT TGATTTTCCT TCAGCTAAAT TAGAAATTTG 2160 

TAGTTTTTCC CCTAAAAAAT TCAATGGCAT TCTTTCTTAT AAATTACATT CTCTGATTTT 2220 

CTTGTCAGCC TGCTTCAAGG AAATCCATGT GTTCAAAATG CTTGCTCGCA GTTTGCTCCA 2280 

TACCAMTGG TTGCTTAACC CAAATATCTG AGCAGCAAAT TGAGCTGATC CTTCTGGAGA 2340 

MGTACGGTT GAACAGCCAA GACCACTGGG TAGTCGAAGA GAAGACCACA CATCCTGAAC 2400 

TCCCCAGTCT GGTGTGAGGG GAGGACAGCT GATMCTGGA TATGCAGTGT TCCCAGACAT 2460 

CACTGGTCCC AAACCATTAC TTCTGCCTGC CACTGCCACA AATACAGTAG GAATGCCATC 2520 

CCCTTCATAC TCAGCTTTAA TCCTCAGAGT TTCATCTGGT CCTTTATGCG CAGATGTTAC 2580 

TCGAAGTTCA CATGGAATGC CAAAATTTCC ACAGGCCTTC TTGATTTTTT CACAGTGACC 2640 

AAGATCAGAA GTAGAGCCCA TCAACACTAC AACCCTGCAC TGACTTTCTG ATTTCAAAAG 2700 

CAACTCTACT CTCTCTGCAA CCCACTCAAA GTTTTTCTTT ACCATTTGGA GCCCTTCAGG 2760 

AGTTACTTCT TTGAGGTCCC GATMGACTG TTTGTCTTTC TGTTGGCTTC GATCTCCTGA 2820 

TGGCCAGAGT CTCCAGGAAT CATTGTCAAT AACATCAGCA AGAACAATTT CTTTGGTGGT 2880 

TACATCAACA CCAAATTCAA TCTTCATATC AACCAGTGTA CAATTCTGGG GCAACCAGGA 2940 

TTTCTCCAGT ATTTCAAATA TAGCCTGTGT AGCATCTCGT GCCGAATTCA AAAAGCTT 2998 

(2) INFORMATION FOR SEQ ID NO: 5: 

{i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4152 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 

<ii> MOLECULE TYPE: Genomic DMA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

AAGCTTTTTG TGAAAACCCT AGGATATGTC CCCTCCCTCA CCACACCCAA CCCCCCGCCC 60 

CTGCCCCAGG ACATGACGAT GCCTCACACA CACACACACA CACACATACA CACAAGGCCG 120 

TGAGCTGCAC GCAGGAACAT GGGCTGCACT CACGACAACA TTGAAAAAAT ATACATTATA 180 

TATGTACACC CGGGGCCCCC ACGTCCCCTC CCGTCCCCGC AGCCTGGCCA CACCAGGTCA 240 

CGGAGGAGGG GCCGGGGCTG CAGGACCTCA GGACTGCAAG GGCAGGAAGG GAAACAGGAC 300 

AAGAAAGGAA GGAAGTTGGA AAGGAGGGAG AAATGGGGTC CCCAGACTGA AATGGAAATG 360 

AGGTGGGGCG ATCATAAGAG AAGCAGGGAC GATGGTCCAG CTGAGGGAGC CCTGCAGAGG 420 

GGGAAAAGCT TCCCATGGAC AGGAGAGAGA AGGGAAGGGG AGAGGAGAGG GTTTCCTTCA 480 

ATCCCACCCC CAGCCCCAGC CCCAGCCCCA GCCATTGCAA TCGTCACCCT CTCCCCAACA 540 

CAGTGAGTGC TAAGGGGGCA GCTGCCATTG GGGGTAGAAA GGCAGCTGAA GTCCAGCCCA 600 

CTTTCCAACC CAGCCAGCCC CAGTGCAAGG GGCACACCAG GAGCATGACA GCCCAGAAGT 660 

GAGGGATGGG GGGCCGGGGG AGGGGCAGGG CGGACTCCAG AGGGCCCGCT GGGGTTTTGA 720 

AATGAAAGGA GGACTGGTTC TGAAGCCTCT CTCCCTCTTG GTCTCTGTGT TCCCAGAAAG 780 

TCCTTCTCCC ATGTCTGGAG TGTCTGTTTC ACCAGGGCAG AATTCCCCCT CTGCGTGGGG 840 

AGAGGTGTAG GCCTTAGTAG CGGTGTGGGG GGGTCTCGAT GATGCGTCTC TCGTCGCTGC 900 

TGGGGGAATC GGCCACCTCC GAGTCACTGC TGTCCTCATC CTCCTGCTGG CCCCCAACAG 960 

CCCCCGTCAC ACAGGACTGC CGATTCTGGT AGGACTCCAT GGGGTTCACA ATGATGGTGA 1020 

GAGCTGAGTC ATCCCAGAAG AGGTCTGGGT CCTTGGGGTC ACTGGAGGCC CCTGGAGGCC 1080 

CGCCGGCCCC TGAGACGCGG CGGTGAAGGG AATGGATGCG CACCAGGCCC AGGACGACCA 1140 

TGAGCACCAG GAAGCCCACG CACACCACAA TGATGAGGGT TGCGGCGCTG GGTATCATGG 1200 

AGTTTCTGTG GGAGCTGGCT AGGCTGTGTC CAGCCATCTC AGGCGGGGGC TGGTGACCAC 1260 

GGTGCAGGAA CTGCTGGGAG CTGAGCACGT GGCTGGGGTG GGCAACCCGG TTCATGCTGT 1320 

GCAGGACATT GACCTCCACG ATGAATTCAT TGCTGGAGTA ACGGCCATTC ATTTCCGAGC 1380 

AGGAAAGCCG GAACTTCCTG GTGTAGAGGG CAGCTCCGTG TCGCAGCCGA TAACGAGCCT 1440 

GCCTCAGGAT CTCTTCATAC ACAGTGATGC TCTCCACCCC AGCAATAGTG AGGTAGGCAG 1500. 

ATGTGTTGGT GAGCTCCAGC CCCCGCTGCT GCAGAGAGGT TGTGTCCAGG AGCAGGCTTT 1560 

CCCGCTCGGG ATCCAGGTCA TCCCCCACCA GAGAAATTTC ACAGCCATCC AGGTTGTGCA 1620 

CAATCTCATC CGACATGCGT GTGTCTGTCA CTGTGCCCTG CCAACTCTCA TCCTTTTTGG 1680 

CCTCCACCTG GTGAGAAATG GAGCAGGTGA TTTGAAGATC AGGGAACAAA GGGACGCCGT 1740 

TGGTTCCCTC AAAGTCCACA GCTGGGCGGG CAAAATGAGC AGTGCCACTC AGCAGGATCT 1800 

GGGGGGCGTC AGGCTGAAGG ACGACCACGT AGCCCTCCAC TTCAGGGATG GAGACGCAGG 1860 

ACTCTTCGCT GAAGCACTTG ACAGCAGTGG TGAGGCGCAG GGGCCTGACG CCGGGCGTGG 1920 

CAAAGCGCAG AGTGTTCATG TAAGCCACAT GCTGCAGGGC ATGGTTGAAG GTCTCCACAT 1930 

CATCCCCCTC CAGGGTGAGC AGGGACTGTG AGGGGTTCAC GTGGACCTTC ATGCCTTTGC 2040 

CCAGGCTCTC GAAATCCCTA TAGTCCAGCC CCTCCCGACA TGCATAGAGG CACTCGATGA 2100 

CCTCGCGGCT CTCCAGGCGA CCTGAGCGCA CGCTGAAACC AGCCAGGTAG CCATGGAAGT 2160 

AGTGGTGGAT CGACAAAGGG TCTCCTTGGG TGGTGTCTGT ACTGTTGTCT CCCTTTTCCT 2220 

TCTCTTTGTT CTTCTCCTCA GTCCAGCAGG CCCCAATCAT GAGAGCAGGC TCCCTTCGGG 2280 

GTGGGTGGAT GAGGCCATTG TCATGGATGA GGGCAGGGTC GAAGGAGATG CCGTCGGTAT 2340 

AGAGTGTGAC TGTGGGGAAC TCGAGGTTCA GAGCGTAGTG GTGCCACTCA TCATCACAGA 2400 

CCTGCTCCAG CTTCCAGAGG AACTTGACTG GGCGGGCACT CTCAAGCAGG GGCCAGTAGA 2460 

GGAAGGCAAT CCTACAGCCG TGGACAGTCA GCGAGTAGTG AGAGAAGCCG TCCTCATTCT 2520 

GGACAGTGTT ACATACGATG GTTTCCTCTT CCTTCTTGCC CTTGTTGGGA GTTACGCCAT 2580 

GCTTCATCCA GAAGGACAGG GTGAAGTGGT CACTGAGGCT GTCCTGGGGC CCAGAGCCCA 2640 
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GCCCACTGGG GCCACCCAGG GGCACCTGCA CAGCCTGGGT GCCATTGAAC CAGTAGATCA 2700 

GGCTGCTGTC CTGGCTGTAG TGCACCGAGA GTCCTGCTGT CCAGTTGGCA TTGGGGCCAG 2760 

GCATGGGCAA CAGATCCACT TCCCCAGTGG CAGCACCACA GAGTTTCCGC AGCGCCCGCT 2820 

CTGAGTAGTT GTCACGGTCA CAGCCCTTGG CCACATGGCT GGTCTGCAGC TCTATGGTGG 2880 

CCTGAATGTT CCAGAGTGGT TCATCACAGG TCTCCAGGCG GATACCAGGG AACAAAGCCA 2940 

AGCTCCCAGC ACCTGGTGCA TATTCGATCC TTTTGTTCCA GCCTTGCCAG CTGGGTTTAC 3000 

AGGTGGGCTT CACCTGAATC TCCACCTCAG CATCATCTGC TGCCCGCTTC TTCCCACAGT 3060 

CATAAGCTGT CACTGTAAAC TTATAGAGCC TCTCACCACT GTACTGCAGC TTCTCTGTGT 3120 

TCTCAATGTT CCCGTCATTG TCAATGAGGA AAGGGGTGTT GGGTGTGAGA ATCTCATAGT 3180 

AGCAGATCTG GCTGTACTGG GGGGAGCAGT CACCGTCAAT GGCTTCCACC CGCAGGATGC 3240 

GATCGTACAG CTTCCCCTCT GTCACAGCCG CACGATACAG CCGTTCCACA AACACTGGGG 3300 

CAAACTCGTT CACATCGTTG ACCCGCACAT GCACAGTGGC CTTGTGGGAC TTCTTGGTGT 3360 

TGGCCCCGTC GGGGCCCTCG CCACAGTCAT AGGCCTGGAT GGTGAAGGTG TGTTCCTTCT 3420 

GGGCCTCGCA GTCCACAGGC TCCTTGGCCC GGATCAGCCC CTCTCCTGTC GCCTTGTCAA 3480 

GGATCACAGC CTCAAAGGGC ACCCCAGACC CATGGAGCCG GAAGCCGCAG ATCTCACCTG 3540 

CATAGCGCAG CGGGGCATCC TTGTCCAAGG CAAAGAGTGG TGGATTCAGT AGGACCGTGT 3600 

TGTCATTCTC CATGACGATG CCCTGGTACT CTGCCTCAAT CCATGGCTTG TGCTTGTTGG 3660 

CTTTGTTACA GGAGCAGGAC GCGAGCAGAG AGGCCAGCAG AAGGGGCAGC AGCAGGAGGG 3720 

TCATGGTGCG GCGTGGGGCA GGGCAGGGCC AGGCGTTTGC CTCCCCTGGG AGCCTCCAGC 3780 

CTGCGGATTC CACCTTGCGG GAGGGATACA GGGGGGGAAA ACCAAAATAA AACGTCAAAT 3840 

AAATTGTGTA GGAGGAGTCC AGCTTAGGAC CGGGCCAGAG CCAGGCCAGG CTCGGGGAGG 3900 

GGGCCTCTGC AGGTTCAGAG GATCACTGCT GCCACCACCG CCACCCTGGG AGCCAGTTAT 3960 

TTTGCCATGG CCTTGATTGC AACAGCTGCC TCCTCTGTCA TGGCAGACAG CACCGTGATC 4020 

AGGATCTCTT CTCCACAGTC GTACTTCTGC TCAATCTCCT TGCCMGGTC TCCCTCAGGG 4080 

AGACGAAGGT CCTCTCGTAC CTCCCCGCTG TCCTGGAGCA GTGATAGGTA CCCATCCTGG 4140 

ATCTTTGGAT CC 4152 

(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3117 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DMA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

GGATCCAAAG ATTCGGCACG AGTGGCCACA TCATGAACCT CCAGGCCCAG CCCAAGGCTC 60 

AGAACAAGCG GAAGCGTTGC CTCTTTGGGG GCCAGGAACC AGCTCCCAAG GAGCAGCCCC 120 

CTCCCCTGCA GCCCCCCCAG CAGTCCATCA GAGTGAAGGA GGAGCAGTAC CTCGGGCACG 180 

AGGGTCCAGG AGGGGCAGTC TCCACCTCTC AGCCTGTGGA ACTGCCCCCT CCTAGCAGCC 240 

TGGCCCTGCT GAACTCTGTG GTGTATGGGC CTGAGCGGAC CTCAGCAGCC ATGCTGTCCC 300 

AGCAGGTGGC CTCAGTAAAG TGGCCCAACT CTGTGATGGC TCCAGGGCGG GGCCCGGAGC 360 

GTGGAGGAGG TGGGGGTGTC AGTGACAGCA GCTGGCAGCA GCAGCCAGGC CAGCCTCCAC 420 

CCCATTCAAC ATGGAACTGC CACAGTCTGT CCCTCTACAG TGCAACCAAG GGGAGCCCGC 480 

ATCCTGGAGT GGGAGTCCCG ACTTACTATA ACCACCCTGA GGCACTGAAG CGGGAGAAAG 540 

CGGGGGGCCC ACAGCTGGAC CGCTATGTGC GACCAATGAT GCCACAGAAG GTGCAGCTGG 600 

AGGTAGGGCG GCCCCAGGCA CCCCTGAATT CTTTCCACGC AGCCAAGAAA CCCCCAAACC 660 

AGTCACTGCC CCTGCAACCC TTCCAGCTGG CATTCGGCCA CCAGGTGAAC CGGCAGGTCT 720 

TCCGGCAGGG CCCACCGCCC CCAAACCCGG TGGCTGCCTT CCCTCCACAG AAGCAGCAGC 780 

AGCAGCAGCA ACCACAGCAG CAGCAGCAGC AGCAGCAGGC AGCCCTACCC CAGATGCCGC 840 

TCTTTGAGAA CTTCTATTCC ATGCCACAGC AACCCTCGCA GCAACCCCAG GACTTTGGCC 900. 

TGCAGCCAGC TGGGCCACTG GGACAGTCCC ACCTGGCTCA CCACAGCATG GCACCCTACC 960 

CCTTCCCCCC CAACCCAGAT ATGAACCCAG AACTGCGCAA GGCCCTTCTG CAGGACTCAG 1020 

CCCCGCAGCC AGCGCTACCT CAGGTCCAGA TCCCCTTCCC CCGCCGCTCC CGCCGCCTCT 1080 

CTAAGGAGGG TATCCTGCCT CCCAGCGCCC TGGATGGGGC TGGCACCCAG CCTGGGCAGG 1140 

AGGCCACTGG CAACCTGTTC CTACATCACT GGCCCCTGCA GCAGCCGCCA CCTGGCTCCC 1200 

TGGGGCAGCC CCATCCTGAA GCTCTGGGAT TCCCGCTGGA GCTGAGGGAG TCGCAGCTAC 1260 

TGCCTGATGG GGAGAGACTA GCACCCAATG GCCGGGAGCG AGAGGCTCCT GCCATGGGCA 1320 

GCGAGGAGGG CATGAGGGCA GTGAGCACAG GGGACTGTGG GCAGGTGCTA CGGGGCGGAG 1380 

TGATCCAGAG CACGCGACGG AGGCGCCGGG CATCCCAGGA GGCCAATTTG CTGACCCTGG 1440 

CCCAGAAGGC TGTGGAGCTG GCCTCACTGC AGAATGCAAA GGATGGCAGT GGTTCTGAAG 1500 

AGAAGCGGAA AAGTGTATTG GCCTCAACTA CCAAGTGTGG GGTGGAGTTT TCTGAGCCTT 1560 

CCTTAGCCAC CAAGCGAGCA CGAGAAGACA GTGGGATGGT ACCCCTCATC ATCCCAGTGT 1620 

CTGTGCCTGT GCGAACTGTG GACCCAACTG AGGCAGCCCA GGCTGGAGGT CTTGATGAGG 1680 

ACGGGAAGGG TCTTGAACAG AACCCTGCTG AGCACAAGCC ATCAGTCATC GTCACCCGCA 1740 

GGCGGTCCAC CCGAATCCCC GGGACAGATG CTCAAGCTCA GGCGGAGGAC ATGAATGTCA 1800 

AGTTGGAGGG GGAGCCTTCC GTGCGGAAAC CAAAGCAGCG GCCCAGGCCC GAGCCCCTCA 1860 

TCATCCCCAC CAAGGCGGGC ACTTTCATCG CCCCTCCCGT CTACTCCAAC ATCACCCCAT 1920 

ACCAGAGCCA CCTGCGCTCT CCCGTGCGCC TAGCTGACCA CCCCTCTGAG CGGAGCTTTG 1980 

AGCTACCTCC CTACACGCCG CCCCCCATCC TCAGCCCTGT GCGGGAAGGC TCTGGCCTCT 2040 
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ACTTCAATGC CATCATATCA ACCAGCACCA TCCCTGCCCC TCCTCCCATC ACGCCTMGA 2100 

GTGCCCATCG CACGCTGCTC CGGACTAACA GTGCTGAAGT AACCCCGCCT GTCCTCTCTG 2160 

TGATGGGGGA GGCCACCCCA GTGAGCATCG AGCCACGGAT CAACGTGGGC TCCCGGTTCC 2220 

AGGCAGAAAT CCCCTTGATG AGGGACCGTG CCCTGGCAGC TGCAGATCCC CACAAGGCTG 2280 

ACTTGGTGTG GCAGCCATGG GAGGACCTAG AGAGCAGCCG GGAGAAGCAG AGGCAAGTGG 2340 

AAGACCTGCT GACAGCCGCC TGCTCCAGCA TTTTCCCTGG TGCTGGCACC AACCAGGAGC 2400 

TGGCCCTGCA CTGTCTGCAC GAATCCAGAG GAGACATCCT GGAAACGCTG AATAAGCTGC 2460 

TGCTGAAGAA GCCCCTGCGG CCCCACAACC ATCCGCTGGC AACTTATCAC TACACAGGCT 2520 

CTGACCAGTG GAAGATGGCC GAGAGGAAGC TGTTCAACAA AGGCATTGCC ATCTACAAGA 2580 

AGGATTTCTT CCTGGTGCAG AAGCTGATCC AGACCAAGAC CGTGGCCCAG TGCGTGGAGT 2640 

TCTACTACAC CTACAAGAAG CAGGTGAAAA TCGGCCGCAA TGGGACTCTA ACCTTTGGGG 2700 

ATGTGGATAC GAGCGATGAG AAGTCGGCCC AGGAAGAGGT TGAAGTGGAT ATTAAGACTT 2760 

CCCAAAAGTT CCCAAGGGTG CCTCTTCCCA GAAGAGAGTC CCCAAGTGAA GAGAGGCTGG 2820 

AGCCCAAGAG GGAGGTGAAG GAGCCCAGGA AGGAGGGGGA GGAGGAGGTG CCAGAGATCC 2880 

AAGAGAAGGA GGAGCAGGAA GAGGGGCGAG AGCGCAGCAG GCGGGCAGCG GCAGTCAAAG 2940 

CCACGCAGAC ACTACAGGCC AATGAGTCGG CCAGTGACAT CCTCATCCTC CGGAGCCACG 3000 

AGTCCAACGC CCCTGGGTCT GCCGGTGGCC AGGCCTCGGA GAAGCCAAGG GAAGGGACAG 3060 

GGAAGTCACG AAGGGCACTA CCTTTTTCAG AAAAAAAAAA AAAAAAACAA AAAGCTT 3117 

<2> INFORMATION FOR SEQ ID NO:7: 

O") SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3306 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

GAATTCGGCA CGAGGTCAGT TTCCTGTGGA ACACAGAGGC TGCCTGTCCC ATTCAGACAA 60 

CGACGGATAC AGACCAGGCT TGCTCTATAA GGGATCCCAA CAGTGGATTT GTGTTTAATC 120 

TTAATCCGCT AAACAGTTCG CAAGGATATA ACGTCTCTGG CATTGGGAAG ATTTTTATGT 180 

TTAATGTCTG CGGCACAATG CCTGTCTGTG GGACCATCCT GGGAAAACCT GCTTCTGGCT 240 

GTGAGGCAGA AACCCAAACT GAAGAGCTCA AGAATTGGAA GCCAGCAAGG CCAGTCGGAA 300 

TTGAGAAAAG CCTCCAGCTG TCCACAGAGG GCTTCATCAC TCTGACCTAC AAAGGGCCTC 360 

TCTCTGCCAA AGGTACCGCT GATGCTTTTA TCGTCCGCTT TGTTTGCAAT GATGATGTTT 420 

ACTCAGGGCC CCTCAAATTC CTGCATCAAG ATATCGACTC TGGGCAAGGG ATCCGAAACA 480 

CTTACTTTGA GTTTGAAACC GCGTTGGCCT GTGTTCCTTC TCCAGTGGAC TGCCAAGTCA 540 

CCGACCTGGC TGGAAATGAG TACGACCTGA CTGGCCTAAG CACAGTCAGG AAACCTTGGA 600 

CGGCTGTTGA CACCTCTGTC GATGGGAGAA AGAGGACTTT CTATTTGAGC GTTTGCAATC 660 

CTCTCCCTTA CATTCCTGGA TGCCAGGGCA GCGCAGTGGG GTCTTGCTTA GTGTCAGAAG 720 

GCAATAGCTG GAATCTGGGT GTGGTGCAGA TGAGTCCCCA AGCCGCGGCG AATGGATCTT 780 

TGAGCATCAT GTATGTCAAC GGTGACAAGT GTGGGAACCA GCGCTTCTCC ACCAGGATCA 840 

CGTTTGAGTG TGCTCAGATA TCGGGCTCAC CAGCATTTCA GCTTCAGGAT GGTTGTGAGT 900 

ACGTGTTTAT CTGGAGAACT GTGGAAGCCT GTCCCGTTGT CAGAGTGGAA GGGGACAACT 960 

GTGAGGTGAA AGACCCAAGG CATGGCAACT TGTATGACCT GAAGCCCCTG GGCCTCAACG 1020 

ACACCATCGT GAGCGCTGGC GAATACACTT ATTACTTCCG GGTCTGTGGG AAGCTTTCCT 1080 

CAGACGTCTG CCCCACAAGT GACAAGTCCA AGGTGGTCTC CTCATGTCAG GAAAAGCGGG 1140 

AACCGCAGGG ATTTCACAAA GTGGCAGGTC TCCTGACTCA GAAGCTAACT TATGAAAATG 1200 

GCTTGTTAAA AATGAACTTC ACGGGGGGGG ACACTTGCCA TAAGGTTTAT CAGCGCTCCA 1260 

CAGCCATCTT CTTCTACTGT GACCGCGGCA CCCAGCGGCC .AGTATTTCTA AAGGAGACTT 1320 

CAGATTGTTC CTACTTGTTT GAGTGGCGAA CGCAGTATGC CTGCCCACCT TTCGATCTGA 1380. 

CTGAATGTTC ATTCAAAGAT GGGGCTGGCA ACTCCTTCGA CCTCTCGTCC CTGTCAAGGT 1440 

ACAGTGACAA CTGGGAAGCC ATCACTGGGA CGGGGGACCC GGAGCACTAC CTCATCAATG 1500 

TCTGCAAGTC TCTGGCCCCG CAGGCTGGCA CTGAGCCGTG CCCTCCAGAA GCAGCCGCGT 1560 

GTCTGCTGGG TGGCT CCAAG CCCGTGAACC TCGGCAGGGT AAGGGACGGA CCTCAGTGGA 1620 

GAGATGGCAT AATTGTCCTG AAATACGTTG ATGGCGACTT ATGTCCAGAT GGGATTCGGA 1680 

AAAAGTCAAC CACCATCCGA TTCACCTGCA GCGAGAGCCA AGTGAACTCC AGGCCCATGT 1740 

TCATCAGCGC CGTGGAGGAC TGTGAGTACA CCTTTGCCTG GCCCACAGCC ACAGCCTGTC 1800 

CCATGAAGAG CAACGAGCAT GATGACTGCC AGGTCACCAA CCCAAGCACA GGACACCTGT 1860 

TTGATCTGAG CTCCTTAAGT GGCAGGGCGG GATTCACAGC TGCTTACAGC GAGAAGGGGT 1920 

TGGTTTACAT GAGCATCTGT GGGGAGAATG AAAACTGCCC TCCTGGCGTG GGGGCCTGCT 1980 

TTGGACAGAC CAGGATTAGC GTGGGCAAGG CCAACAAGAG GCTGAGATAC GTGGACCAGG 2040 

TCCTGCAGCT GGT6TACAAG GATGGGTCCC CTTGTCCCTC CAAATCCGGC CTGAGCTATA 2100 

AGAGTGTGAT CAGTTTCGTG TGCAGGCCTG AGGCCGGGCC AACCAATAGG CCCATGCTCA 2160 

TCTCCCTGGA CAAGCAGACA TGCACTCTCT TCTTCTCCTG GCACACGCCG CTGGCCTGCG 2220 

AGCAAGCGAC CGAATGTTCC GTGAGGAATG GAAGCTCTAT TGTTGACTTG TCTCCCCTTA 2280 

TTCATCGCAC TGGTGGTTAT GAGGCTTATG ATGAGAGTGA GGATGATGCC TCCGATACCA 2340 

ACCCTGATTT CTACATCAAT ATTTGTCAGC CACTAAATCC CATGCACGGA GTGCCCTGTC 2400 

CTGCCGGAGC CGCTGTGTGC AAAGTTCCTA TTGATGGTCC CCCCATAGAT ATCGGCCGGG 2460 

TAGCAGGACC ACCAATACTC AATCCAATAG CAAATGAGAT TTACTTGAAT TTTGAAAGCA 2520 
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GTACTCCTTG CCAGGAATTC AGTTGTAAAT AAAATTGAAC CTGCTCAACA GCTGAGGGAG 2580 

ACTAGAAATG ATGGGTCCAT ATCCTGGTGC ATTGTCATAC AATTCAAACA ATGGTGCAGC 2640 

TACCAGCTTG TAATTTTTAG GGACTGCAAA CAAGGCTTTT TCTTGAAGCT GAACCAGAAA 2700 

CAACTTCTTA TGTTCCTTA6 GCTTTGTAAT ATGTGCAGGA ATATATGGAT ACTGAGGAGG 2760 

TTCAAAATTT GGTCTCCACC AGTTACCAAT GCAATCGTCA ATGACCCAGT CTTGCAAAAC 2820 

TCCATCCTGA CGACCCAGTA TCTCTGTCAT TAAGCGTTTT AGTCCTTCAA CTTCATCTTC 2880 

TCCTGGGTTA AGTTCACCAC CAGGTAGTTT GAAGAAAGTT GTTCCCAGCT GCAGCAGTAA 2940 

CACATGGGGT AGCCGGTGCT CATGTACAAT CAGAACCCCT TCTACAGTCC TCCTCATTCC 3000 

AATTTTATCA AATTCTTCCC TCATGCGCTG AAATCTGGCT GCAACAGAGC TGTCCTTCTC 3060 

GTAGAGGGGC TCTTTTGTAC CAAAAGTATA ATTGGTAAGA GGGTACAGGT TGATGGTGCG 3120 

CTCCAGGGTG AGGGGCTTCG TCTGCTGGAT GTACTTGTTG CCGAACTGAG TGACCCCCCG 3180 

GGGCCAGCCG GTCTGCGAGC GATTGGGCGG TACCACAGAC ATGCTGGCGA GCTCCGGCGC 3240 

TGACGGCGAG CAGAAAGTGG CAGGCAGGGT AGACTTTCCC CGTGCGGGAA GCCTCGTGCC 3300 



(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4218 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

GAATTCGGCA CGAGAATGGA TCAACCTCAA CAACACGTTA AAGCTAGACG AAAGAAGTAA 60 

TACACAGTGT ATGAGTCTCA CATGAAATAC CCGGATGTAA ATCCAAAGAA ACAGGAAGCA 120 

GATTGGTGGT TGCCAGGGAC AAGGGCGGTG GGAGGAGAAA ATGGAGAGTA ACGGGACTTT 180 

ACTTTTGGAG TGATGAGAAT GTTTTGGAGC TAGATAGAAG TGGTGGTTGT ACACCATTGT 240 

GGATGTACTA CCACTTAATT GTTCACTTAA AAAGTTAATT TATGTGAATT GCATCTTAAT 300 

TAAAAACAAG GATAACATTC CAACTCCTGG ACATTATCCT TCCTTTCCAT TTGATGTCAG 360 

GCCCGTGTTA GAATTCTCAT CCGGTTTGGT CACTGCACTT AAGATGTGGA GAAATTAGGA 420 

CGCACAGTTA AGAGGAAGGA TAACACTGAT TAAGGTAGTG CTTTTCTAGG TTTCCCCTAA 480 

ACAATTTAAC AGATGGATAG TGGCACCACT TACGAGATGG AAAAACCAGC GGAAGGAAGA 540 

TTTGGGGGAG AAGTTAAGTT TGTCTTGGGC CTGTGTTTTG CAACCTGAGT GTAAAAGACA 600 

TATGTTAAGT CTTCAGTGGC GAAACACTAA AACTAGAAAT GGATCAGAAT TTTATCTTTG 660 

GATGTGACTT CTCAAGGATG GTCTTGTCAC TTCAGTGCCT GGTCAAATGA CAAGATGGGC 720 

AATCTTTTCC TGAAGGTCCA AGCACCTGAA CGTGGCAGGG TGACCCGATT CCGATTTGCT 780 

TAGAACAATC CTAGTTCATG CCTATTGTCC CTCATGTAAT TAATATCACT CTCAAAATGT 840 

CTCATTTTGT GCAATAAATT CTGCAACGTG ATGGCGCGAC TCTCGCGGCC CGAGCGGCCG 900 

GACCTTGTCT TCGAGGAAGA GGACCTCCCC TATGAGGAGG AAATCATGCG GAACCAATTC 960 

TCTGTCAAAT GCTGGCTTCA CTACATCGAG TTCAAACAGG GCGCCCCGAA GCCCAGGCTC 1020 

AATCAGCTAT ACGAGCGGGC ACTCAAGCTG CTGCCCTGCA GCTACAAACT CTGGTACCGA 1080 

TACCTGAAGG CGCGTCGGGC ACAGGTGAAG CATCGCTGTG TGACCGACCC TGCCTATGAA 1140 

GATGTCAACA AC7GTCATGA GAGGGCCTTT GTGTTCATGC ACAAGATGCC TCGTCTGTGG 1200 

CTAGATTACT GCCAGTTCCT CATGGACCAG GGGCGCGTCA CACACACCCG CCGCACCTTC 1260 

GACCGTGCCC TCCGGGCACT GCCCATCACG CAGCACTCTC GAATTTGGCC CCTGTATCTG 1320 

CGCTTCCTGC GCTCACACCC ACTGCCTGAG ACAGCTGTGC GAGGCTATCG GCGCTTCCTC 1380 

AAGCTGAGTC CTGAGAGTGC AGAGGAGTAC ATTGAGTACC TCAAGTCAAG TGACCGGCTG 1440 

GATGAGGCCG CCCAGCGCCT GGCCACCGTG GTGAACGACG AGCGTTTCGT GTCTAAGGCC 1500 

GGCAAGTCCA ACTACCAGCT GTGGCACGAG CTGTGCGACC TCATCTCCCA GAATCCGGAC 1560 

AAGGTACAGT CCCTCAATGT GGACGCCATC ATCCGCGGGG GCCTCACCCG CTTCACCGAC 1620. 

CAGCTGGGCA AGCTCTGGTG TTCTCTCGCC GACTACTACA TCCGCAGCGG CCATTTCGAG 1680 

AAGGCTCGGG ACGTGTACGA GGAGGCCATC CGGACAGTGA TGACCGTGCG GGACTTCACA 1740 

CAGGTGTTTG ACAGCTACGC CCAGTTCGAG GAGAGCATGA TCGCTGCAAA GATGGAGACC 1800 

GCCTCGGAGC TGGGGCGCGA GGAGGAGGAT GATGTGGACC TGGAGCTGCG CCTGGCCCGC 1860 

TTCGAGCAGC TCATCAGCCG GCGGCCCCTG CTCCTCAACA GCGTCTTGCT GCGCCAAAAC 1920 

CCACACCACG TGCACGAGTG GCACAAGCGT GTCGCCCTGC ACCAGGGCCG CCCCCGGGAG 1980 

ATCATCAACA CCTACACAGA GGCTGTGCAG ACGGTGGACC CCTTCAAGGC CACAGGCAAG 2040 

CCCCACACTC TGTGGGTGGC GTTTGCCAAG TTTTATGAGG ACAACGGACA GCTGGACGAT 2100 

GCCCGTGTCA TCCTGGAGAA GGCCACCAAG GTGAACTTCA AGCAGGTGGA TGACCTGGCA 2160 

AGCGTGTGGT GTCAGTGCGG AGAGCTGGAG CTCCGACACG AGAACTACGA TGAGGCCTTG 2220 

CGGCTGCTGC GAAAGGCCAC GGCGCTGCCT GCCCGCCGGG CCGAGTACTT TGATGGTTCA 2280 

GAGCCCGTGC AGAACCGCGT GTACAAGTCA CTGAAGGTCT GGTCCATGCT CGCCGACCTG 2340 

GAGGAGAGCC TCGGCACCTT CCAGTCCACC AAGGCCGTGT ACGACCGCAT CCTGGACCTG 2400 

CGTATCGCAA CACCCCAGAT CGTCATCAAC TATGCCATGT TCCTGGAGGA GCACAAGTAC 2460 

TTCGAGGAGA GCTTCAAGGC GTACGAGCGC GGCATCTCGC TGTTCAAGTG GCCCAACGTG 2520 

TCCGACATCT GGAGCACCTA CCTGACCAAA TTCATTGCCC GCTATGGGGG CCGCAAGCTG 2580 

GAGCGGGCAC GGGACCTGTT TGAACAGGCT CTGGACGGCT GCCCCCCAAA ATATGCCAAG 2640 

ACCTTGTACC TGCTGTACGC ACAGCTGGAG GAGGAGTGGG GCCTGGCCCG GCATGCCATG 2700 

GCCGTGTACG AGCGTGCCAC CAGGGCCGTG GAGCCCGCCC AGCAGTATGA CATGTTCAAC 2760 

— 54 — 



ATCTACATCA AGCGGGCGGC CGAGATCTAT GGGGTCACCC ACACCCGCGG CATCTACCAG 2820 

AAGGCCATTG AGGTGCTGTC GGACGAGCAC GCGCGTGAGA TGTGCCTGCG GTTTGCAGAC 2880 

ATGGAGTGCA AGCTCGGGGA GATTGACCGC GCCCGGGCCA TCTACAGCTT CTGCTCCCAG 2940 

ATCTGTGACC CCCGGACGAC CGGCGCGTTC TGGCAGACGT GGAAGGACTT TGAGGTCCGG 3000 

CATGGCAATG AGGACACCAT CAAGGAAATG CTGCGTATCC GGCGCAGCGT GCAGGCCACG 3060 

TACAACACGC AGGTCAACTT CATGGCCTCG CAGATGCTCA AGGTCTCGGG CAGTGCCACG 3120 

GGCACCGTGT CTGACCTGGC CCCTGGGCAG AGTGGCATGG ACGACATGAA GCTGCTGGAA 3180 

CAGCGGGCAG AGCAGCTGGC GGCTGAGGCG GAGCGTGACC AGCCCTTGCG CGCCCAGAGC 3240 

AAGATCCTGT TCGTGAGGAG TGACGCCTCC CGGGAGGAGC TGGCAGAGCT GGCACAGCAG 3300 

GTCAACCCCG AGGAGATCCA GCTGGGCGAG GACGAGGACG AGGACGAGAT GGACCTGGAG 3360 

CCCAACGAGG TTCGGCTGGA GCAGCAGAGC GTGCCAGCCG CAGTGTTTGG GAGCCTGAAG 3420 

GAAGACTGAC CCGTCCCCTC GTGCCGAATT CGGCACGAGC AAGACCAGCC CCCAGATCAT 3480 

TTGCCTCAAA GGTTTTCCCT CGAAGTCACA AATGT7TCAA GGAATCTCAA ATTTTACAAA 3540 

GTTTGAAGTG TGGGCATTGG TGGCCTGTGG CTGTGTCCTC TCTCTGTAGC TGTTTTCTCC 3600 

CTACATCCCT GAAAGGAAGT TGAGCCTGCT CCTCCATCCG CAGACCTCCC TTTCCAGCGC 3660 

CCAGGGCATG GGGTGCTGTG AGGGCAGCAT GCTAGGTGTG ACCGTGCTCC TGGCCTCCAG 3720 

GCCCGTGTCC CTCTGTCCTC TAGCCCACTA AGGCCCTGGC CCATTTGTGC TAAACAGGCA 3780 

GTCGGACCTA GAAAGAGCAG ACAATCTCTC TGGGTCACCA GTCTGGCTAG GAGCTGGTCT 3840 

CCTGACTGGG ATCCAGGCCT TCTCCCCTGC CCATGTGAAT TCCCAGGGGC AGAGCCTGAA 3900 

ATGTTGAACA CAGCACTGGC CAAAGAGATG TCACCGTGGG AACCGAGGCT CTCTTCTCCT 3960 

CCTGCCTGCT TTCGTGGGTT CAGAGTAGCT GAGGCTTGTC TGAGAGGAGT TGGAGTGCTG 4020 

GTTTTCACCC TGGTTGGTGT GCTTTGCTTT GAGGGCACTT AGAAAGCCCA GCCCAGCCCT 4080 

TGCTCCTGCC CTGCACACAG CGGAGCGACT TTTCTAGGTA TGCTCTTGAT TTCTGCAGAA 4140 

GCAGCAGGTG GCATGGAGCC AAGAGGAAGT GTGACTGAAA CTGTCCACTC ATAGCCCGGC 4200 

TGCCGTATTG AGAGGGCT 4218 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND ED NESS: double 
CD) TOPOLOGY: linear 

Cii) MOLECULE TYPE: Genomic DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

GAGCTCGCGC GCCTGCAGGT CGACACTAGT GGATCCAAAG AATTCGGCAC GAGGGAAACT 60 

CAACGGTGTA CGAGTGGAGG ACAGGGACAG AGCCCTCTGT GGTGGAACGA CCCCACCTCG 120 

AGGAGCTTCC TGAGCAGGTG GCAGAAGATG CGATTGACTG GGGCGACTTT GGGGTAGAGG 180 

CAGTGTCTGA GGGGACTGAC TCTGGCATCT CTGCCGAGGC TGCTGGAATC GACTGGGGCA 240 

TCTTCCCGGA ATCAGATTCA AAGGATCCTG GAGGTGATGG GATAGACTGG GGAGACGATG 300 

CTGTTGCTTT GCAGATCACA GTGCTGGAAG CAGGAACCCA GGCTCCAGAA GGTGTTGCCA 360 

GGGGCCCAGA TGCCCTGACA CTGCTTGAAT ACACTGAGAC CCGGAATCAG TTCCTTGATG 420 

AGCTCATGGA GCTTGAGATC TTCTTAGCCC AGAGAGCAGT GGAGTTGAGT GAGGAGGCAG 480 

ATGTCCTGTC TGTGAGCCAG TTCCAGCTGG CTCCAGCCAT CCTGCAGGGC CAGACCAAAG 540 

AGAAGATGGT TACCATGGTG TCAGTGCTGG AGGATCTGAT TGGCAAGCTT ACCAGTCTTC 600 

AGCTGCAACA CCTGTTTATG ATCCTGGCCT CACCAAGGTA TGTGGACCGA GTGACTGAAT 660 

TCCTCCAGCA AAAGCTGAAG CAGTCCCAGC TGCTGGCTTT GAAGAAAGAG CTGATGGTGC 720 

AGAAGCAGCA GGAGGCACTT GAGGAGCAGG CGGCTCTGGA GCCTAAGCTG GACCTGCTAC 780 

TGGAGAAGAC CAAGGAGCTG CAGAAGCTGA TTGAAGCTGA CATCTCCAAG AGGTACAGCG 840 

GGCGCCCTGT GAACCTGATG GGAACCTCTC TGTGACACCC TCCGTGTTCT TGCCTGCCCA 900 

TCTTCTCCGC TTTTGGGATG AAGATGATAG CCAGGGCTGT TGTTTTGGGG CCCTTCAAGG 960. 

CAAAAGACCA GGCTGACTGG AAGATGGAAA GCCACAGGAA GGAAGCGGCA CCTGATGGTG 1020 

ATCTTGGCAC TCTCCATGTT CTCTACAAGA AGCTGTGGTG ATTGGCCCTG TGGTCTATCA 1080 

GGCGAAAACC ACAGATTCTC CTTCTAGTTA GTATAGCGCA AAAAGCTTCT CGAGAGTACT 1140 

TCTAGAGCGG CCGCGGGCCC ATCGATTTTC CACCCGGGTG GGGTACC 1187 

C2) INFORMATION FOR SEQ ID NO:10: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3306 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: double 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: Genomic DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCCTCACTAA AGGGAACAAA AGCTGGAGCT CGCGCGCCTG CAGGTCGACA CTAGTGGATC 60 

GAAAGTTCGT TACGCCAAGC TCGAAATTAA CTCTGGGCTG ACCCATAAAC ATTTGTCTGA 120 
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TCTAGGATAT AGT7GCGTTT CTTGCGGGCA GCAATCTGGA TGAGGCGGTT GAGGCACTGG 180 

GTGGCCTGCT GGATCAGGAC ATCCCAGCGG CCAGCATAGT TCCGCTGCCG GCGTAGGCCC 240 

ATCACCCGCA TCTTATCCAT GATGGCATTG GTACCCAGGA TGTTGTACTT CTTGGAAGGG 300 

TTGGAGGCTG CATGTTTGAT GGCCCATGTG GTCTTGCCAG CAGCAGGCAG GCCCACCATC 360 

ATCAGAATCT CACATTCTGC CTTGCTCTTT GGTCCAACGG TGCCCCGGAT ACGCTCACTA 420 

AGGGGAAGGT GCTGGATGAA GGTAAACCCC GGGAGGACAG AACAGTAGGG CTCTGCTCTC 480 

TGTCCGAAGT TGAACTCCAC TGCGCAATTC TTCACCAGGA CATGAGGATA GAGGGCCTGA 540 

CCCCCCAAGG CTTCCTTCTG GATTCGGAAA GCAATGCCCA TCCACTTTCC ATTCTTGGTA 600 

AAAGACAGTT CCACGTCATT TCCACATTCA AAATCCGCAA AGCAGCCAAT CACCGGAGAG 660 

CTCTSCGGTG CTAGGAGAGC GGCTGGGCCC GCAGACTGGG GGGAAAGCTC CGCAGCCGCA 720 

GTGGGCCCCA GGATCAGGCC CCGCGTGGCC TGGAGAAGCC CAGTCTGGGC TGGAGCGGGA 780 

GCTGGACAGT GTGGCCTTGC GTTCGCCCCC GGGAGCGCTG CGAGTGTCGC GGCCTCGGGT 840 

GGATTTGCTG AGCACCAATA CCTCACGGTT GCCAACCTGG GGTTTTAGCT CCCTTGGTTT 900 

TAATCCCCTA GGGGCGGGTG GGGGCACGGG AGGAAGGATG GGCCAGCTGG GTGCAATCCT 960 

GCTGTAAGCC AGCCATTCCT TGATTTCTTA GAATTAACTA AACGGTCGCG CCGGAGGCCG 1020 

CGGGGGCCGG AGCGGAGCAG CCGCGGCTGA GGTTCCCGAG TCGGCCGCTC GGGGCTGCGC 1080 

TCCGCCGCCG GGACCCCGGC CTCTGGCCGC GCCGGCTCCG GCCTCCGGGG GGGCCGGGGC 1140 

CGCCGGGACA TGGTGCCAGT CGCACCCCTT CCCCGCCGCC GCTGAGCTCG CCGGCCGCGC 1200 

CCGGGCTGGG ACGTCCGAGC GGGAAGATGT TTTCCGCCCT GAAGAAGCTG GTGGGGTCGG 1260 

ACCAGGCCCC GGGCCGGGAC AAGAACATCC CCGCCGGGCT GCAGTCCATG AACCAGGCGT 1320 

TGCAGAGGCG CTTCGCCAAG GGGGTGCAGT ACAACATGAA GATAGTGATC CGGGGAGACA 1380 

GGAACACGGG CAAGACAGCG CTGTGGCACC GCCTGCAGGG CCGGCCGTTC GTGGAGGAGT 1440 

ACATCCCCAC ACAGGAGATC CAGGTCACCA GCATCCACTG GAGCTACAAG ACCACGGATG 1500 

ACATCGTGAA GGTTGAAGTC TGGGATGTAG TAGACAAAGG AAAATGCAAA AAGCGAGGCG 1560 

ACGGCTTAAA GATGGAGAAC GACCCCCAGG AGNCGGAGTC TGAAATGGCC CTGGATGCTG 1620 

AGTTCCTGGA CGTGTACAAG AACTGCAACG GGGTGGTCAT GATGTTCGAC ATTACCAAGC 1680 

AGTGGACCTT CAATTACATT CTCCGGGAGC TTCCAAAAGT GCCCACCCAC GTGCCAGTGT 1740 

GCGTGCTGGG GAACTACCGG GACATGGGCG AGCACCGAGT CATCCTGCCG GACGACGTGC 1800 

GTGACTTCAT CGACAACCTG GACAGACCTC CAGGTTCCTC CTACTTCCGC TATGCTGAGT 1860 

CTTCCATGAA GAACAGCTTC GGCCTAAAGT ACCTTCATAA GTTCTTCAAT ATCCCATTTT 1920 

TGCAGCTTCA GAGGGAGACG CTGTTGCGGC AGCTGGAGAC GAACCAGCTG GACATGGACG 1980 

CCACGCTGGA GGAGCTGTCG GTGCAGCAGG AGACGGAGGA CCAGAACTAC GGCATCTTCC 2040 

TGGAGATGAT GGAGGCTCGC AGCCGTGGCC ATGCGTCCCC ACTGGCGGCC AACGGGCAGA 2100 

GCCCATCCCC GGGCTCCCAG TCACCAGTCC TGCCTGCACC CGCTGTGTCC ACGGGGAGCT 2160 

CCAGCCCCGG CACACCCCAG CCCGCCCCAC AGCTGCCCCT CAATGCTGCC CCACCATCCT 2220 

CTGTGCCCCC TGTACCACCC TCAGAGGCCC TGCCCCCACC TGCGTGCCCC TCAGCCCCCG 2280 

CCCCACGGCG CAGCATCATC TCTAGGCTGT TTGGGACGTC ACCTGCCACC GAGGCAGCCC 2340 

CTCCACCTCC AGAGCCAGTC CCGGCCGCAC AGGGCCCAGC AACGGTCCAG AGTGTGGAGG 2400 

ACTTTGTTCC TGACGACCGC CTGGACCGCA GCTTCCTGGA AGACACAACC CCCGCCAGGG 2460 

ACGAGAAGAA GGTGGGGGCC AAGGCTGCCC AGCAGGACAG TGACAGTGAT GGGGAGGCCC 2520 

TGGGCGGCAA CCCGATGGTG GCAGGGTTCC AGGACGATGT GGACCTCGAA GACCAGCCAC 2580 

GTGGGAGTCC CCCGCTGCCT GCAGGCCCCG TCCCCAGTCA AGACATCACT CTTTCGAGTG 2640 

AGGAGGAAGC AGAAGTGGCA GCTCCCACAA AAGGCCCTGC CCCAGCTCCC CAGCAGTGCT 2700 

CAGAGCCAGA GACCAAGTGG TCCTCCATAC CAGCTTCGAA GCCACGGAGG GGGACAGCTC 2760 

CCACGAGGAC CGCA6CACCC CCCTGGCCAG GCGGTGTCTC TGTTCGCACA GGTCCGGAGA 2820 

AGCGCAGCAG CACCAGGCCC CCTGCTGAGA TGGAGCCGGG GAAGGGTGAG CAGGCCTCCT 2880 

CGTCGGAGAG TGACCCCGAG GGACCCATTG CTGCACAAAT GCTGTCCTTC GTCATGGATG 2940 

ACCCCGACTT TGAGAGCGAG GGATCAGACA CACAGCGCAG GGCGGATGAC TTTCCCGTGC 3000 

GAGATGACCC CTCCGATGTG ACTGACGAGG ATGAGGGCCC TGCCGAGCCG CCCCCACCCC 3060 

CCAAGCTCCC TCTCCCCGCC TTCAGACTGA AGAATGACTC GGACCTCTTC GGGCTGGGGC 3120 

TGGAGGAGGC CGGACCCAAG GAGAGCAGTG AGGAAGGTAA GGAGGGCAAA ACCCCCTCTA 3180 

AGGAGAAGAA AAAAAAAACA AAAAGCTTCT CGAGAG7ACT TCTAGAGCGG CCGCGGGCCC 3240 

ATCGATTTTC CACCCGGGTG GGGTACCAGG TAAGTGTACC CAATTCGCCC TATAGTGAGT 3300 

CGTATT 330fr 

(2) INFORMATION FOR SEQ ID NO:11: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 
TGCGGGGCCA GAGTGGGCTG 

(23 INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GCAGTCCTGG CCTGCGGATG 

(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
CO STRAND EDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

GTCGACAGGA GAATTGGTTC 

C2> INFORMATION FOR SEQ ID NO:H: 

Ci> SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
CC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GCCTGGGTTC GGTGCGGGAC 

C2) INFORMATION FOR SEQ ID NO:15: 

Ci) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 

CC) STRANDEDNESS: single 

CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
TGGTCGGGTG TTTGTGAGTG 

C2) INFORMATION FOR SEQ ID NO:16: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 

CC) STRANDEDNESS: single 

CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 
CCTCTTCCGT CTCCTCAGTG 

C2) INFORMATION FOR SEQ ID NO: 17: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
GGATTGCTAG TCTCACAGAC 



(2) INFORMATION FOR SEQ ID N0:18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

TTAAGGGTGG CTGAAGGGAC 

(2) INFORMATION FOR SEQ ID N0:19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO:19: 
ACCTTCCCTC CCTGTCACAG 

(2) INFORMATION FOR SEQ ID NO:20: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
TGGTCGGGTG TTTGTGAGTG 

C2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

ACACCATTCC AGAAATTCAG 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 
AAACTGCAGG TGGCTGAGTC 

(2) INFORMATION FOR SEQ ID NO:23: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



GTCCTAATGT TTTCAGGGAG 20 

(2) INFORMATION FOR SEQ ID N0:24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(8) TYPE: nucleic acid 
( C ) ' ST RAND EDNESS : single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

AAAACCTATG GTTACAATTC 20 

<2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

TCCTAGACAT GGTTCAAGTG 20 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
GATATAATTA GTTCTCCATC 20 
(2) INFORMATION FOR SEQ ID N0:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

ATGCCTGTTC CAGGCTGCAC 20 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
GGACGGCGAC CTCCACCCAC 

(2) INFORMATION FOR SEQ ID NO:29: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 
GGGCTCCTCC GACGCCTGAG 

(2) INFORMATION FOR SEQ ID N0:30: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CC) STRAND EDNESS: single 
(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 
AGTCTAGCCC TGGCCTTGAC 

(2) INFORMATION FOR SEQ ID NO-.31: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS: single 
<D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 

GTCACTGGGG ACTCCGGCAG 

(2) INFORMATION FOR SEQ ID N0:32: 

Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CAGCTTTCCC TGGGCACATG 

(2) INFORMATION FOR SEQ ID N0:33: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 

CC) STRANDEDNESS: single 

CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 
CACAGCTGTC TCAAGCCCAG 

C2) INFORMATION FOR SEQ ID N0:34: 

<i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 

CC) STRANDEDNESS: single 

CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 



ACTGTTCCCC CTACATGATG 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
ATCATATCCT CTTGCTGGTC 

(2) INFORMATION FOR SEQ ID NO:36: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
GTTCCCAGAG CTTGTCTGTG 

(2) INFORMATION FOR SEQ ID NO:37: 

(?) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
GTTTGGCAGA CTCATAGTTG 

(2) INFORMATION FOR SEQ ID N0:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
TAGCAGGGAG CCATGACCTG 

(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
CTTGGCGCCA GAAGCGAGAG 

(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
CCTCTCTCTC TCTCTCTCTC 

(2) INFORMATION FOR SEQ ID NO:41 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

TCCCCGCTGA TTCCGCCAAG 

(2) INFORMATION FOR SEQ ID N0:42: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:42: 
CTTTTTGAAT TCGGCACGAG 

(2) INFORMATION FOR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:43: 
CCCCTGGTCC GCACCAGTTC 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GAGAAGGGTC GGGGCGGCAG 

(2) INFORMATION FOR SEQ ID NO:45: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
AAATCACATC GCGTCAACAC 

(2) INFORMATION FOR SEQ ID N0:46: 
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(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TAAGAGAGTC ATAGTTACTC 20 
(2) INFORMATION FOR SEQ ID NO:47: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
GCTCTAGAAG TACTCTCGAG 20 
(2) INFORMATION FOR SEQ ID N0:48: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
ACTCTGGCCA TCAGGAGATC 20 
(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:A9: 
CAGGCGTTGT AGATGTTCTG 20 
(2) INFORMATION FOR SEQ ID N0:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 
AGTGGCAGGC AGAAGTAATG 20 
(2) INFORMATION FOR SEQ ID N0:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 

GGTTGGAGAA CTGGATGTAG 

(2) INFORMATION FOR SEQ ID NO:52: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
CTATTCAGAT GCAACGCCAG 

(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 
CCATGGCACA CAGAGCAGAC 

(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 
GCTACCATGC AGAGACACAG 

(2) INFORMATION FOR SEQ ID NO:55: 

O') SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 
CAGGCTGACA AGAAAATCAG 

(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
GGCACGCATA GAGGAGAGAC 

(2) INFORMATION FOR SEQ ID NO:57: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

TGGGTGATGC CTTTGCTGAC 

(2) INFORMATION FOR SEQ ID NO:58: 

Ci) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 20 base pairs 
<8> TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

AAAACAAGAT CAAGGTGATG 

(2) INFORMATION FOR SEQ ID NO:59: 

(i) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:59: 
TTGCCCACAT TGCTATGGTG 

(2) INFORMATION FOR SEQ ID NO:60: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID M0:60: 
GACCAAGATC AGAAGTAGAG 

(2) INFORMATION FOR SEQ ID NO:61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID N0:61: 
CCCCTGGGCC AATGATGTTG 

(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 



TCTTCCCACC ATAGCAATG 
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(2) INFORMATION FOR SEQ ID N0:63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:63: 
TGGTCTTGGT GACCAATGTG 

(2) INFORMATION FOR SEQ ID NO.-64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:64: 

ACACCTCGGT GACCCCTGTG 

(2) INFORMATION FOR SEQ ID NO.-65: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
<B> TYPE: nucleic acid 
CO STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

TCTCCAAGTT CGGCACAGTG 

(2) INFORMATION FOR SEQ ID N0:66: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 
ACATGGGCTG CACTCACGAC 

(2) INFORMATION FOR SEQ ID N0:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
GATCCTCTGA ACCTGCAGAG 

(2) INFORMATION FOR SEQ ID N0-.68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
GGAAATGAGG TGGGGCGATC 20 
(2) INFORMATION FOR SEQ ID NO:69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:69: 
CTTTGCCTTG GACAAGGATG 20 
(2) INFORMATION FOR SEQ ID N0:70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
GCACCTGCCA TTGGGGGTAG 20 
(2) INFORMATION FOR SEQ ID N0:71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:71 : 

GGTGGAAGCC ATTGACGGTG 20 

C2> INFORMATION FOR SEQ ID NO:72: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:72: 
TGCGTCTCTC GTCGCTGCTG 20 
(2) INFORMATION FOR SEQ ID N0:73: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0.-73: 
GCGGAAACTC TGTGGTGCTG 

(2) INFORMATION FOR SEQ ID NO-.74: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
AGGATTGCCT TCCTCTACTG 

(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:75: 

TGTCTGTTTC ACCAGGGCAG 

(2) INFORMATION FOR SEQ ID NO:76: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:76: 
CCAGTGCCTC TATGCATGTC 

(2) INFORMATION FOR SEQ ID N0:77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 



(X?) SEQUENCE DESCRIPTION: SEQ ID N0:77: 
AGGAAGCCCA CGCACACCAC 

(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:78: 
CCCTTTGTTC CCTGATCTTC 

(2) INFORMATION FOR SEQ ID NO:79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:79: 
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CGCTCGGGAT CCAGGTCATC 



(2) INFORMATION FOR SEQ ID NO:80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
TCGAGGTTCA GAGCGTAGTG 

(2) INFORMATION FOR SEQ ID N0:81: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<X1> SEQUENCE DESCRIPTION: SEQ ID NO-.81: 
TCTTGGATCT CTGGCACCTC 

(2) INFORMATION FOR SEQ ID NO:82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

CCATCAGAGT GAAGGAGGAG 

(2) INFORMATION FOR SEQ ID N0:83: 

Ci) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 

CCATCTTCCA CTGGTCAGAG 

(2) INFORMATION FOR SEQ ID N0:84: 

<i> SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
CTCCTTCTCT TGGATCTCTG 

(2) INFORMATION FOR SEQ ID NO:85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 
TTACTTCAGC ACTGTTAGTC 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 

AGGGAGGTAG CTCAAAGCTC 

(2> INFORMATION FOR SEQ ID N0:87: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID N0:87: 

TGGGTCCACA GTTCGCACAG 

(2) INFORMATION FOR SEQ ID N0:88: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
CAACTCTGTG ATGGCTCCAG 

<2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



CXI") SEQUENCE DESCRIPTION: SEQ ID NO:89: 
AGCAGGGTTC TGTTCAAGAC 

(2) INFORMATION FOR SEQ ID NO:90: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 
CCATTGGGTG CTAGTCTCTC 
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(2) INFORMATION FOR SEQ ID N0:91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO:91: 

CAGCCATGCT GTCCCAGCAG 

C2) INFORMATION FOR SEQ ID NO:92: 

Ci) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 20 base pairs 
<B> TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO:92: 
CTGGACCTGA GGTAGCGCTG 

(2) INFORMATION FOR SEQ ID N0:93: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

ATAACCACCC TGAGGCACTG 

(2) INFORMATION FOR SEQ ID NO:94: 

<i> SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 
CCTGCAGGTC GACACTAGTG 

(2) INFORMATION FOR SEQ ID NO:95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
<D> TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 
AATTGGAATG AGGAGGACTG 

(2) INFORMATION FOR SEQ ID NO:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:96: 
GCTCTAGAAG TACTCTCGAG 

(2) INFORMATION FOR SEQ ID N0:97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:97: 
ATTGTATGAC AATGCACCAG 

(2) INFORMATION FOR SEQ ID N0:98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:98: 
TCCACAGAGG GCTTCATCAC 

(2) INFORMATION FOR SEQ ID NO:99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:99: 
CCTGACTGGC CTAAGCACAG 

(2) INFORMATION FOR SEQ ID NO: 100: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 
AAGCCTCATA ACCACCAGTG 

(2) INFORMATION FOR SEQ ID NO: 101: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 
TGTCAACGGT GACAAGTGTG 

(2) INFORMATION FOR SEQ ID N0:102: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
TTGTACACCA GCTGCAGGTC 20 
(2) INFORMATION FOR SEQ ID N0:103: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

GGGTGTGGTG CAGATGAGTC 20 

(2) INFORMATION FOR SEQ ID NO:104: 

<i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
IB) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

ATCACACTCT TATAGCTCAG 20 

(2) INFORMATION FOR SEQ ID NO:105: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 105: 
GTGGGAAGCT TTCCTCAGAC 20 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
TGATGAACAT GGGCCTGGAG 20 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
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CATTGTGGAT GTACTACCAC 

(2) INFORMATION FOR SEQ ID NO: 108: 

Ci) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

TGTGTTTTGC AACCTGAGTG 

(2) INFORMATION FOR SEQ ID NO:109: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
CB) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

ATAGTGGCAC CACTTACGAG 

(2) INFORMATION FOR SEQ ID NO:110: 

Ci) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 

CC) STRANDEDNESS: single 

CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
AATTCTGCAA CGTGATGGCG 

C2) INFORMATION FOR SEQ ID NO: 111: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:11l: 

CACAAGATGC CTCGTCTGTG 

C2> INFORMATION FOR SEQ ID NO: 112: 

Ci) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:112: 
AATCCGGACA AGGTACAGTC 

(2) INFORMATION FOR SEQ ID NO:113: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 20 base pairs 

CB) TYPE: nucleic acid 

CC) STRANDEDNESS: single 
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CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:113: 
GCACGAGTGG CACAAGCGTG 

(2) INFORMATION FOR SEQ ID NO:1K: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:114: 
GCAAGCGTGT GGTGTCAGTG 

(2) INFORMATION FOR SEQ ID NO:115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:115: 
TGTTTGAACA GGCTCTGGAC 

(2) INFORMATION FOR SEQ ID NO:116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID M0:116: 
CGGCATGGCA ATGAGGACAC 

(2) INFORMATION FOR SEQ ID NO:117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:117: 
AGGACGAGAT GGACCTCCAG 

(2) INFORMATION FOR SEQ ID N0:118: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:118: 

CCCTCTGTCC TCTAGCCCAC 

(2) INFORMATION FOR SEQ ID N0:119: 
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(f) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 20 base pairs 
(B) TYPE: nucleic acfd 
<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:119: 
TCTTGAGGGG ACTGACTCTG 

(2) INFORMATION FOR SEQ ID NO: 120: 

(j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:120: 

TGAGTGAGGA GGCAGATGTC 

(2) INFORMATION FOR SEQ ID NO:121: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:121: 

TGGCTTTGAA GAAAGAGCTG 

<2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(B> TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GCAAAAGACC AGGCTGACTG 

(2) INFORMATION FOR SEQ ID NO:123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 
TGCAGCTCCT TGGTCTTCTC 

(2) INFORMATION FOR SEQ ID N0:124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
GATTCACAGT CCCAAGCCTC 

(2) INFORMATION FOR SEQ ID NO: 125: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 
ATCTGGATGA GGCGGTTGAG 

(2) INFORMATION FOR SEQ ID N0:126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

GGTCACTCTC CGACGAGGAG 

(2) INFORMATION FOR SEQ ID NO: 127: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
GGATCCAAAG TTCGTCTCTG 

(2) INFORMATION FOR SEQ ID NO: 123: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 
CGCTGTGTGT CTGATCCCTC 

(2) INFORMATION FOR SEQ ID NO: 129: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:129: 

ATGAAGGTAA ACCCCGGGAG 

(2) INFORMATION FOR SEQ ID NO: 130: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<x?> SEQUENCE DESCRIPTION: SEQ ID N0:130: 
TGGTCTCTGG CTCTGAGCAC 

(2) INFORMATION FOR SEQ ID N0:131: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:131: 
GCCTGGAGAA GCCCAGTCTG 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:132: 

CACACTCTGG ACCGTTGCTG 

(2) INFORMATION FOR SEQ ID NO:133: 

O") SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 20 base pairs 
<B> TYPE: nucleic acid 
<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
AAAGCTCCGC AGCCGCAGTG 

(2) INFORMATION FOR SEQ ID NO:134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
TCTTCCAGGA AGCTGCGGTC 

<2) INFORMATION FOR SEQ ID NO:135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:135: 



GATGGTGGGG CAGCATTGAG 
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(2) INFORMATION FOR SEQ ID NO: 136: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:136: 
GTCACCAGTG GTGCCTGCAG 

(2) INFORMATION FOR SEQ ID NO: 137: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:137: 
ACCTCACGGT TGCCAACCTG 

<2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
CGCAACAGCG TCTCCCTCTG 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:139: 
AGTACCTTCA TAAGTTCTTC 

(2) INFORMATION FOR SEQ ID N0:140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:140: 
TCCCAGACTT CAACCTTCAC 

(2) INFORMATION FOR SEQ ID NO:141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 



AAACATCTTC CCGGTCGGAC 20 
(2) INFORMATION FOR SEQ ID N0:142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0.-142: 
GCTGAGCACC TTTACCTCAC 20 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:143: 
GACGTCCGTC CGGGAAGATG 20 
(2) INFORMATION FOR SEQ ID N0:144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:144: 
ACACAGGAGA TGCAGGTCAC 20 
<2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO:145: 
GAGTCTTCCA TGAAGAACAG 20 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0.-146: 
GCAGTGAGGA AGGTAAGGAG 

(2) INFORMATION FOR SEQ ID N0:147: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: Genomic DNA 
<ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 378... 1799 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:147: 

GGATCCAAAG GACGCCCCCG CCGACAGGAG AATTGGTTCC CGGGCCCGCG GCGATGCCCC 60 

CCCGGTAGCT CGGGCCCGTG GTCGGGTGTT TGTGAGTGTT TCTATGTGGG AGAAGGAGGA 120 

GGAGGAGGAA GAAGAAGCAA CGATTTGTCT TCTCGGCTGG TCTCCCCCCG GCTCTACATG 180 

TTCCCCGCAC TGAGGAGACG GAAGAGGAGC CGTAGCCGCC CCCCCTCCCG GCCCGGATTA 240 

TAGTCTCTCG CCACAGCGGC CTCGGCCTCC CCTTGGATTC AGACGCCGAT TCGCCCAGTG 300 

TTTGGGAAAT GGGAAGTAAT GACAGCTGGC ACCTGAACTA AGTACTTTTA TAGGCAACAC 360 

CATTCCAGAA ATTCAGG ATG AAT GGG GAT ATG CCC CAT GTC CCC ATT ACT 410 
Met Aan Gly Asp Met Pro His Val Pro lie Thr 
1 5 10 

ACT CTT GCG GGG ATT GCT AGT CTC ACA GAC CTC CTG AAC CAG CTG CCT 458 
Thr Leu Ala Gly He Ala Ser Leu Thr Asp Leu Leu Asn Gin Leu Pro 
15 20 25 

CTT CCA TCT CCT TTA CCT GCT ACA ACT ACA AAG AGC CTT CTC TTT AAT 506 
Leu Pro Ser Pro Leu Pro Ala Thr Thr Thr Lys Ser Leu Leu Phe Asn 
30 35 40 

GCA CGA ATA GCA GAA GAG GTG AAC TGC CTT TTG GCT TGT AGG GAT GAC 554 
Ala Arg lie Ala Glu Glu Val Asn Cys Leu Leu Ala Cys Arg Asp Asp 
45 50 55 

AAT TTG GTT TCA CAG CTT GTC CAT AGC CTC AAC CAG GTA TCA ACA GAT 602 
Asn Leu Val Ser Gin Leu Val His Ser Leu Asn Gin Val Ser Thr Asp 
60 65 70 75 

CAC ATA GAG TTG AAA GAT AAC CTT GGC AGT GAT GAC CCA GAA GGT GAC 650 
His He Glu Leu Lys Asp Asn Leu Gly Ser Asp Asp Pro Glu Gly Asp 
80 85 90 

ATA CCA GTC TTG TTG CAG GCC GTC CTG GCA AGG AGT CCT AAT GTT TTC 698 
lie Pro Vat Leu Leu Gin Ala Val Leu Ala Arg Ser Pro Asn Val Phe 
95 100 105 

AGG GAG AAA AGC ATG CAG AAC AGA TAT GTA CAA AGT GGA ATG ATG ATG 746 
Arg Glu Lys Ser Met Gin Asn Arg Tyr Val Gin Ser Gly Met Met Met 
110 115 120 

TCT CAG TAT AAA CTT TCT CAG AAT TCC ATG CAC AGT AGT CCT GCA TCT 794 . 
Ser Gin Tyr Lys Leu Ser Gin Asn Ser Met His Ser Ser Pro Ala Ser 
125 130 135 

TCC AAT TAT CAA CAA ACC ACT ATC TCA CAT AGC CCC TCC AGC CGG TTT 842 
Ser Asn Tyr Gin Gin Thr Thr lie Ser His Ser Pro Ser Ser Arg Phe 
140 145 150 155 

GTG CCA CCA CAG ACA AGC TCT GGG AAC AGA TTT ATG CCA CAG CAA AAT 890 
Val Pro Pro Gin Thr Ser Ser Gly Asn Arg Phe Met Pro Gin Gin Asn 
160 165 170 

AGC CCA GTG CCT AGT CCA TAC GCC CCA CAA AGC CCT GCA GGA TAC ATG 938 
Ser Pro Val Pro Ser Pro Tyr Ala Pro Gin Ser Pro Ala Gly Tyr Met 
175 180 185 

CCA TAT TCC CAT CCT TCA AGT TAC ACA ACA CAT CCA CAG ATG CAA CAA . 986 
Pro Tyr Ser His Pro Ser Ser Tyr Thr Thr His Pro Gin Met Gin Gin 
190 195 200 
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GCA TCG GTA TCA AGT CCC ATT GTT GCA GGT GGT TTG AGA AAC ATA CAT 
Ala Ser Val Ser Ser Pro He Val Ala Gly Gly Leu Arg Asn He His 
205 210 215 



1034 



GAT AAT AAA GTT TCT GGT CCG TTG TCT GGC AAT TCA GCT AAT CAT CAT 1082 
Asp Asn Lys Val Ser Gly Pro Leu Ser Gly Asn Ser Ala Asn His His 
220 225 230 235 

GCT GAT AAT CCT AGA CAT GGT TCA AGT GAG GAC TAC CTA CAC ATG GTG 1130 
Ala Asp Asn Pro Arg His Gly Ser Ser Glu Asp Tyr Leu His Met Val 
240 245 250 

CAC AGG CTA AGT AGT GAC GAT GGA GAT TCT TCA ACA ATG AGG AAT GCT 1178 
His Arg Leu Ser Ser Asp Asp Gly Asp Ser Ser Thr Met Arg Asn Ala 
255 260 265 

GCA TCT TTT CCC TTG AGA TCT CCA CAG CCA GTA TGC TCC CCT GCT GGA 1226 
Ala Ser Phe Pro Leu Arg Ser Pro Gin Pro Val Cys Ser Pro Ala Gly 
270 275 280 

AGT GAA GGA ACT CCT AAA GGC TCA AGA CCA CCT TTA ATC CTA CAA TCT 1274 
Ser Glu Gly Thr Pro Lys Gly Ser Arg Pro Pro Leu He Leu Gin Ser 
285 290 295 

CAG TCT CTA CCT TGT TCA TCA CCT CGA GAT GTT CCA CCA GAT ATC TTG 1322 
Gin Ser Leu Pro Cys Ser Ser Pro Arg Asp Val Pro Pro Asp He Leu 
300 305 310 315 

CTA GAT TCT CCA GAA AGA AAA CAA AAG AAG CAG AAG AAA ATG AAA TTA 1370 
Leu Asp Ser Pro Glu Arg Lys Gin Lys Lys Gin Lys Lys Met Lys Leu 
320 325 330 

GGC AAG GAT GAA AM GAG CAG AGT GAG AAA GCG GCA ATG TAT GAT ATA 1418 
Gly Lys Asp Glu Lys Glu Gin Ser Glu Lys Ala Ala Met Tyr Asp He 
335 340 345 

ATT AGT TCT CCA TCC AAG GAC TCT ACT AAA CTT ACA TTA AGA CTT TCT 1466 
I le Ser Ser Pro Ser Lys Asp Ser Thr Lys Leu Thr Leu Arg Leu Ser 
350 355 360 

CGT GTA AGG TCT TCA GAC ATG GAC CAG CAA GAG GAT ATG ATT TCT GGT 1514 
Arg Val Arg Ser Ser Asp Met Asp Gin Gin Glu Asp Met He Ser Gly 
365 370 375 

GTG GAA AAT AGC AAT GTT TCA GAA AAT GAT ATT CCT TTT AAT GTG CAG 1562 
Val Glu Asn Ser Asn Val Ser Glu Asn Asp lie Pro Phe Asn Val Gin 
380 385 390 395 

TAC CCA GGA CAG ACT TCA AAA ACA CCC ATT ACT CCA CAA GAT ATA AAC 1610 
Tyr Pro Gly Gin Thr Ser Lys Thr Pro lie Thr Pro Gin Asp He Asn 
400 405 410 

CGC CCA CTA AAT GCT GCT CAA TGT TTG TCG CAG CAA GAA CAA ACA GCA 1658 
Arg Pro Leu Asn Ala Ala Gin Cys Leu Ser Gin Gin Glu Gin Thr Ala 
415 420 425 

TTC CTT CCA GCA AAT CAA GTG CCT GTT TTA CAA CAG AAC ACT TCA GTT 1706 
Phe Leu Pro Ala Asn Gin Val Pro Val Leu Gin Gin Asn Thr Ser Val 
430 435 440 

GCT GCA AAA CAA CCC CAG ACC AAT AGT CAC AAA ACC TTG GTG CAG CCT 1754 
Ala Ala Lys Gin Pro Gin Thr Asn Ser His Lys Thr Leu Val Gin Pro 
445 450 455 

GGA ACA GGC ATA GAG GTC TCA GCA GAG CTG CCC AAG GAC AAG ACC TAAGA 1804 
Gly Thr Gly He Glu Val Ser Ala Glu Leu Pro Lys Asp Lys Thr 
460 465 470 

TCCAGCAGGG AACTATGTAG TCACCCCGAG AGGCCCAGCT CTCTCCGTGA GCTCTGGGCC 1864 

TAGGGTGGGG GTGGTTGTTG GTTCTGCGCG CACTGTTCCC CCTACATGAT GGGTCCATCC 1924 

CAGTTGGCTT CTCTCACTCG CTTCCTCCTG TGGAGAAGCC TGTCCAGGTG TCACTGCCTC 1984 

CAGGAAGCTG TCTCTGATTT CTCCAGTTGA ACAGTGAGAT TTGCCACACC TCACATGCAT 2044 

CGCTCTTGTC CCTGGAATTG TAACCATAGG TTTTCCTGTC TCCTGGAGGA CAAGGATGAG 2104 
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GGCTTTCCAC TTGAGTCTCC CTGGTGGAGC CCAGCTCCTG ACATACCTGG TAAAAGTTCT 2164 

CAAGAGAAGA ACATGGAGGA GGAATGTGGA TAACAACCCT GGCTGCCTGT GTGTTCCAAG 2224 

CTAGGAAGAT GTAATGTCCC CACAAACGGG GTAAATGGCT TGCCTGCGTC ACAGCTGTCT 2284 

CAAGCCCAGG CCCTGGGCGC CAGCCCAAGC CCAAGGACTA GGTCCAGAGC CACACAGCGC 2344 

CAGGCCACAT CCGCCTCACC TGGGACCCTT TGTGGGGTAC AGTCTCCGGC CCCACCCAGA 2404 

CCTCCTGAAG GAGAGACCCC ATGGCAAGGA CTCAGCCACC TGCAGTTTCA TAAGCCCCCA 2464 

GTGGGTTCCT AGGCATGAAG ACCACCGGTT AGAGGCTGAA CTGGCAGGAA CCTGTCTCCA 2524 

GCCCCTTCTC ACCCCAGCCG GGCCCTGCCT CAGAGGCAGC ACCCAGGACG TGGCCATGAC 2584 

CCGTGGACTC CACTCAATCC CTCTTCTCCA GGAGCCATGC AAAGTGTCAG CCAGCCAGGC 2644 

CCCTGGAAGG CAGTCATCAC CTCTTAAGGC ATTGTGGGTG TCGGTCCTGC AACTGCCAGG 2704 

TGCAGCACAC GACCCGTGTC CGGTGTTCGA TAGCAGGGAG CCATGACCTG GCAACGATTC 2764 

CACGCTCAAA GGGGCACCCG GGGGGCCCTG GGTCGGGGCG GATCAGCTTT CCCTGGGCAC 2824 

ATCTGCCTCA TTCCAGATCT CCAGGGCTCA TGTCTGTGAC AGGGAGGGAA GGCTCTGCCC 2884 

TGGCCTTCCG TCAGCTCTGC CAGTGCAGGC TGGGCAGCCT GGGCTTTAGA GCTGGCTTCT 2944 

GCCCACACTT TCTCCGTGAA AGGAAAACAA CTATGAGTCT GCCAAACGCA TCTCAGATGC 3004 

GTTTTAAAAA ATTCTGGTCC CCGCTCTCTG TCCCATCATC CGCCTCGGGG ACTTCCTCTC 3064 

TCCGTGGTTC TCACCCCATA CTCTGTCACT GCCACATTTT CACCTGGGCC TGGCCTTTGT 3124 

CTCCACCTGA AACTCCTGAA AATCTTGAAA TGGATTTCTA GGTCACTGGG GACTCCGGCA 3184 

GCACATTCGG CTTCAGAATA AAGGGCGCCC GCGGTCCCCC AGCACCTCCC CAAGCCACAC 3244 

CCCTAGCTTC CCTCCCTATC CCTGCAGCCT GAGGGTCCCT TCAGCCACCC TTAAGTCCCC 3304 

ACCTGGGCTC CTGCCCCGCC CCTGGCTA6C AGCGCCTTCT CCACCGGGGC CCCCTCTGCT 3364 

CACAGAGCCC CCTCACCTCC CTGGGGATGA GGGGCCAGGC CATGACCCTG AAAGTCTAGC 3424 

CCTGGCCTTG ACCTCCCAGG AGCGCCCTCC CCGCCCTCTC CCGGCCCCGG CCCCGTCCTC 3484 

TGCTGCTGGC CTCTGGGTCG TGCCCCGCAG ACTGAGCTGC GCTTGGGGGT CCTGGCGGCC 3544 

TGGGCCGTCC , CGCACCGAAC CCAGGCGGTC GGAGCCCGGC GGGAAGGCGC GAGGTCCTTC 3604 

TGGGGGCTCC TCCGACGCCT GAGGGCGCTG CTTCCCCGCG GCCGCCCCGG GTTTCTGCGG 3664 

AGCCGGGGCC TCCGCTCTCG GGTGACCCGG TGAGACCCCC GGGGAGGCCG CTGGGGAGGC 3724 

GCGGGCTCTG CTCCCGGGTC CCAAACGCAC TGGCTGCCCC TCAGGAGGGA CGGCGACCTC 3784 

CACCCACGGC GCTGGCGCCC GCACGGCCGC TCCTCCCGCT CCCGCAGCCT GGACGCCTCC 3844 

CGAGGCCGCC CCGCCGGGCC CCACGCGCGG CCCCATCCGC AGGCCAGGAC TGCCTTCCCG 3904 

GAGCTGGCGG CCCGCAGCCT GGAGGAGCCG GCCCCAGACG CCCTCCCAGC CCTCCCCAGC 3964 

CCACTCTGGC CCCGCAGCCC CCGCCTGGTC CGAGTGCGGG TCTCTGGCCC CGGCCTTTCC 4024 

CGGGGAAGGA AAGCAAAAAG CTT 4047 



(2) INFORMATION FOR SEQ, ID NO: 148: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear. 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO.-148: 



Met 


Asn 


Gly 


Asp 


Met 


Pro 


His 


Val 


Pro 


He 


Thr 


Thr Leu Ala Gly He 


1 






5 










10 




15 


Ala 


Ser 


Leu 


Thr 


Asp 


Leu 


Leu 


Asn 


Gin 


Leu 


Pro 


Leu Pro Ser Pro Leu 








20 








25 






30 


Pro 


Ala 


Thr 


Thr 


Thr 


Lys 


Ser 


Leu 


Leu 


Phe 


Asn 


Ala Arg He Ala Glu 






35 










40 








45 


Glu 


Val 


Asn 


Cys 


Leu 


Leu 


Ala 


Cys 


Arg 


Asp 


Asp 


Asn Leu Val Ser Gin 




50 










55 








60 


Leu 


Val 


His 


Ser 


Leu 


Asn 


Gin 


Val 


Ser 


Thr 


Asp 


His lie Glu Leu Lys 


65 










70 










75 


80 


Asp 


Asn 


Leu 


Gly 


Ser 


Asp 


Asp 


Pro 


Glu 


Gly 


Asp 


lie Pro Val Leu Leu 










85 










90 




95 


Gin 


Ala 


Val 


Leu 


Ala 


Arg 


Ser 


Pro 


Asn 


Val 


Phe 


Arg Glu Lys Ser Met 








100 










105 






110 


Gin 


Asn 


Arg 


Tyr 


Val 


Gin 


Ser 


Gly 


Met 


Met 


Met 


Ser Gin Tyr Lys Leu 






115 








120 








125 


Ser 


Gin 


Asn 


Ser 


Met 


His 


Ser 


Ser 


Pro 


Ala 


Ser 


Ser Asn Tyr Gin Gin 




130 










135 










140 


Thr 


Thr 


He 


Ser 


His 


Ser 


Pro 


Ser 


Ser 


Arg 


Phe 


Val Pro Pro Gin Thr 


145 










150 










155 


160 


Ser 


Ser 


Gly 


Asn 


Arg 


Phe 


Met 


Pro 


Gtn 


Gin 


Asn 


Ser Pro Val Pro Ser 










165 










170 




175 


Pro 


Tyr 


Ala 


Pro 


Gin 


Ser 


Pro 


Ala 


Gly 


Tyr 


Met 


Pro Tyr Ser His Pro 








180 










185 






190 


Ser 


Ser 


Tyr 


Thr 


Thr 


His 


Pro 


Gin 


Met 


Gin 


Gin 


Ala Ser Val Ser Ser 






195 










200 








205 
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Pro 


He Val 


Ala 


uiy 


iiiy 


Leu 


Arg 


Asn 


I le 


His 


Asp 


Asn 


Lys 


Val 


Ser 




210 




215 










220 










Gly 


Pro Leu 


Ser 


uiy 


Asn 


Ser 


Ala 


Asn 


His 


His 


Ala 


ASp 


Asn 


Pr 




C.C.J 


















C.JJ 










240 


His 


Gly Ser 


Ser 


rl ii 
ulU 


Asp 


Tyr 


Leu 


HIS 


Met 


Vat 

vai 


His 


Arg 


eu 


Ser 


Ser 








245 








250 










255 




Asp 


Asp Gly 


Asp 


Ser 


Ser 


mr 


Met 


Arg 


Asn 


ill a 

Ala 


Ala 


Ser 


Phe 


Pro 


eu 
















265 










270 






Arg 


Ser Pro 


Gin 


Pro 


Val 


Cys 


Ser 


Pro 


Ala 


lily 


Ser 


ulU 


r t w 
uiy 


i nr 


Pro 














£OU 










CO J 








Lys 


Gly Ser 


Arg 


Pro 


Pro 


Leu 


I le 


Leu 


Gin 


Ser 


Gin 


Ser 


Leu 


Pro 


Cys 




290 








295 










300 










Ser 


Ser Pro 


Arg 


Asp 


Val 


Pro 


Pro 


Asp 


I le 


Leu 


Leu 


Asp 


Ser 


Pro 


Glu 


305 






310 








315 








320 


Arg 


Lys Gin 


Lys 


Lys 


Gin 


Lys 


Lys 


Met 


Lys 


Leu 


Gly 


Lys 


Asp 


GlU 


Lys 








325 










330 










335 




Glu 


Gin Ser 


Glu 


Lys 


Ala 


Ala 


Met 


Tyr 


Asp 


I le 


I le 


Ser 


Ser 


Pro 


Ser 






340 










345 










350 






Lys 


Asp Ser 


Thr 


Lys 


Leu 


Thr 


Leu 


Arg 


Leu 


Ser 


Arg 


Val 


Arg 


Ser 


Ser 




355 










360 










365 








Asp 


Met Asp 


Gin 


Gin 


Glu 


Asp 


Met 


I le 


Ser 


Gly 


Val 


Glu 


Asn 


Ser 


Asn 


370 








375 








380 










Val 


Ser Glu 


Asn 


Asp 


I le 


Pro 


Phe 


Asn 


Val 


Gin 


Tyr 


Pro 


Gly 


Gin 


Thr 


385 






390 










395 










400 


Ser 


Lys Thr 


Pro 


lie 




Pro 


Gin 


Asp 


I le 




Arg 


Pro 


Leu 


Asn 


Ala 








405 










410 










415 




Ala 


Gin Cys 


Leu 


Ser 


Gin 


Gin 


Glu 


Gin 


Thr 


Ala 


Phe 


Leu 


Pro 


Ala 


Asn 






420 










425 










430 






Gin 


Val Pro 


Val 


Leu 


Gin 


Gin 


Asn 


Thr 


Ser 


Val 


Ala 


Ala 


Lys 


Gin 


Pro 




435 










440 










445 








Gin 


Thr Asn 


Ser 


His 


Lys 


Thr 


Leu 


Val 


Gin 


Pro 


Gly 


Thr 


Gly 


He 


Glu 




450 








455 










460 










Val 


Ser Ala 


Glu 


Leu 


Pro 


Lys 


Asp 


Lys 


Thr 














465 








470 























<2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2998 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DMA 
<ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
<B> LOCATION: 26... 799 
<D> OTHER INFORMATION: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

AAGCTTTTTG AATTCGGCAC GAGAT GCT ACA CAG GCT ATA TTT GAA ATA CTG 52 
Ala Thr Gin Ala He Phe Glu He Leu 
1 5 

GAG AAA TCC TGG TTG CCC CAG AAT TGT ACA CTG GTT GAT ATG AAG ATT 100 
Glu Lys Ser Trp Leu Pro Gin Asn Cys Thr Leu Val Asp Met Lys He 
10 15 20 25 

GAA TTT GGT GTT GAT GTA ACC ACC AAA GAA ATT GTT CTT GCT GAT GTT 148 
Glu Phe Gly Val Asp Val Thr Thr Lys Glu He Val Leu Ala Asp Val 
30 35 40 

ATT GAC AAT GAT TCC TGG AGA CTC TGG CCA TCA GGA GAT CGA AGC CAA 196 
He Asp Asn Asp Ser Trp Arg Leu Trp Pro Ser Gly Asp Arg Ser Gin 
45 50 55 

CAG AAA GAC AAA CAG TCT TAT CGG GAC CTC AAA GAA GTA ACT CCT GAA 244 
Gin Lys Asp Lys Gin Ser Tyr Arg Asp Leu Lys Glu Val Thr Pro Glu 
60 65 70 



GGG CTC CAA ATG GTA AAG AAA AAC TTT GAG TGG GTT GCA GAG AGA GTA 
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292 



Gty Leu Gin Met Vat Lys Lys Asn Phe Gtu Trp Val Ala Glu Arg Val 
75 80 85 



GAG TTG CTT TTG AAA TCA GAA AGT CAG TGC AGG GTT GTA GTG TTG ATG 340 
Glu Leu Leu Leu Lys Ser Glu Ser Gin Cys Arg Val Val Val Leu Met 
90 95 100 105 

GGC TCT ACT TCT GAT CTT GGT CAC TGT GAA AAA ATC AAG AAG GCC TGT 388 
Gly Ser Thr Ser Asp Leu Gly His Cys Glu Lys lie Lys Lys Ala Cys 
110 115 120 

GGA AAT TTT GGC ATT CCA TGT GAA CTT CGA GTA ACA TCT GCG CAT AAA 436 
Gly Asn Phe Gly lie Pro Cys Glu Leu Arg Val Thr Ser Ala His Lys 
125 130 135 

GGA CCA GAT GAA ACT CTG AGG ATT AAA GCT GAG TAT GAA GGG GAT GGC 484 
Gly Pro Asp Glu Thr Leu Arg lie Lys Ala Glu Tyr Glu Gly Asp Gly 
140 145 150 

ATT CCT ACT GTA TTT GTG GCA GTG GCA GGC AGA AGT AAT GGT TTG GGA 532 
lie Pro Thr Val Phe Val Ala Val Ala Gly Arg Ser Asn Gly Leu Gly 
155 160 165 

CCA GTG ATG TCT GGG AAC ACT GCA TAT CCA GTT ATC AGC TGT CCT CCC 580 
Pro Val Met Ser Gly Asn Thr Ala Tyr Pro Val He Ser Cys Pro Pro 
170 175 180 185 

CTC ACA CCA GAC TGG GGA GTT CAG GAT GTG TGG TCT TCT CTT CGA CTA 628 
Leu Thr Pro Asp Trp Gly Val Gin Asp Val Trp Ser Ser Leu Arg Leu 
190 195 200 

CCC AGT GGT CTT GGC TGT TCA ACC GTA CTT TCT CCA GAA GGA TCA GCT 676 
Pro Ser Gly Leu Gly Cys Ser Thr Val Leu Ser Pro Glu Gly Ser Ata 
205 210 215 

CAA TTT GCT GCT CAG ATA TTT GGG TTA AGC AAC CAT TTG GTA TGG AGC 724 
Gin Phe Ala Ala Gin lie Phe Gly Leu Ser Asn His Leu Val Trp Ser 
220 225 230 

AAA CTG CGA GCA AGC ATT TTG AAC ACA TGG ATT TCC TTG AAG CAG GCT 772 
Lys Leu Arg Ala Ser He Leu Asn Thr Trp lie Ser Leu Lys Gin Ala 
235 240 245 

GAC AAG AAA ATC AGA GAA TGT AAT TTA TAAGAAAGAA TGCCATTGAA TTTTTTA 826 
Asp Lys Lys He Arg Glu Cys Asn Leu 
250 255 

GGGGAAAAAC TACAAATTTC TAATTTAGCT GAAGGAAAAT CAAGCAAGAT GAAAAGGTAA 886 

TTTTAAATTA GAGAACACAA ATAAAATGTA TTAGTGAATA AATGGTGAGG GTAGGCCTAT 946 

TCAGATGCAA GGCCAGCAAT GGGGCTCCCC ATTATCCCCA CCCCTTTGGT CCCAGTCCCC 1006 

TTCTCTGCAA TGGGCACGCA TAGAGGAGAG ACAAAGGGTA TTAGACGCAA CATCATTGGC 1066 

CCAGGGGAGT CCGAGAAGAG CTGCCATTGG CTGACAGGGC ATTTTCAGGC TCTGTCATTG 1126 

GTCAGGGAGC ACACCCCAGC CTGAAGAGTG ATGCCATTGG CCAGGGAGTG GTTTTGTCAT 1186 

AGCCGTTGGC TGTGAAGTGG AAGGAAAAGA TCTGGGAATG AAGCCCTGTG GCCAGGAAGA 1246. 

TAGACAGGGC AGCAACTTCT GGGCCTCCAG GCCCTCTTCC CACCATAGCA ATGTGGGCAA 1306 

AACTGGTGTC AGGCCCCAGC CAGAAAAAGG AGCCCAAGCC AGAGGGCAAG TGACAAAGGA 1366 

TGTACCATGT CCAATCTCCC ACACCCTGGG GCTGCCCTTC CCAATGTCTT TCTTGATAGC 1426 

CAAGTTGGGC TGGGAGCAGC TCACTGCTCC TCTAGCCAGG AGGGTTTCTC AGCTCCTGGA 1486 

GGCCGCAGCT TGATGTTGAA CTGCTGCAGG GTCTGCTCCA GCTGTTTCTG GTTCCCAGCA 1546 

AAGTAGGCGG ACACAGCATT GTGGAAGAGC AGCAGCTGCT TGTGCATCAC CTTGATCTTG 1606 

TTTTCTTCCA GGAACTTGAG CTTGATGGCC ACATCTCCCC GCAGCTTCTC ATACTTGTCC 1666 

CGATGGGCCT GGAAAGTGGC CTGGGCACTC TCAAGTCGAC CACGTGTCCC TGCATCCCGG 1726 

GGGCCTAGAC TCAGCTCCTC TAAGTCTGTT CGGTAGGCAT CATATTCCAG CCTGGCAGCC 1786 

TCATACTGTT TCACAGTCAT GAGCGTGTCT TCCATGGTCT TGGTGACCAA TGTGTTGATG 1846 

CTAGAGACAA AGAAGTTCAC GGCTCCTAGC AGCGTTTCCC CATTCTTGCA TAGTAGTTTC 1906 

TGTGTCTCTG CATTGTAGCC AAATTCCTCC TGAAGCTCTG GGGACTTCTG GCTGAGGTCA 1966 

GCAAAGGCAT CACCCAGTGC ATGCTGGGTC TGCAGCAGGC TGTAGAGGTG GGCTGTCAGT 2026 

GCCCGGCCCA GCTGCAGGAC ACTCTCATAC TTGCGCTTCG TCTCACGCAG CAACTCAATC 2086 

TGCAGCTCTA GCTCCAGGAT TCCGGCGCCT CCACTCCGTC CCCCGCGGGT CTGCTCTGTG 2146 

TGCCATGGAC GGCATTGTCC CAGATATAGC CGTTGGTACA AAGCGGGGAT CTGACGAGCT 2206 

TTTCTCTACT TGTGTCACTA ACGGACCGTT TATCATGAGC AGCAACTCGG CTTCTGCAGC 2266 

AAACGGAAAT GACAGCAAGA AGTTCAAAGG TGACAGCCGA AGTGCAGGCG TCCCCTCTAG 2326 

AGTGATCCAC ATCCGGAAGC TCCCCATCGA CGTCACGGAG GGGGAAGTCA TCTCCCTGGG 2386 
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GCTGCCCTTT GGGAAGGTCA CCAACCTCCT GATGCTGAAG GGGAAAAACC AGGCCTTCAT 2446 

CGAGATGAAC ACGGAGGAGG CTGCCAATAC CATGGTGAAC TACTACACCT CGGTGACCCC 2506 

TGTGCTGCGC GGCCAGCCCA TCTACATCCA GTTCTCCAAC CACAAGGAGC TGAAGACCGA 2566 

CAGCTCTCCC AACCAGGCGC GGGCCCAGGC GGCCCTGCAG GCGGTGAACT CGGTCCAGTC 2626 

GGGGAACCTG GCCTTGGCTG CCTCGGCGGC GGCCGTGGAT GCAGGGATGG CGATGGCCGG 2686 

GCAGAGCCCC GTGCTCAGGA TCATCGTGGA GAACCTCTTC TACCCTGTGA CCCTGGATGT 2746 

GCTGCACCAG ATTTTCTCCA AGTTCGGCAC AGTGTTGAAG ATCATCACCT TCACCAAGAA 2806 

CAACCAGTTC CAGGCCCTGC TGCAGTATGC GGACCCCGTG AGCGCCCAGC ACGCCAAGCT 2866 

GTCGCTGGAC GGGCAGAACA TCTACAACGC CTGCTGCACG CTGCGCATCG ACTTTTCCAA 2926 

GCTCACCAGC CTCAACGTCA AGTACAACAA TGACAAGAGC CGTGACTACC TCGTGCCGAA 2986 

TTCTTTGGAT CC 2998 

<2) INFORMATION FOR SEQ ID NO:150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 





<xi) SEQUENCE 


DESCRIPTION 


SEQ ID 


NO:150: 


Ala 


Thr Gin 


Ala 


He 


Phe 


Glu 


lie 


Leu 


GlU 


Lys Ser Trp Leu Pro Gin 


1 






5 










10 


15 


Asn 


Cys Thr 


Leu 


Val 


Asp 


Met 


Lys 


He 


Glu 


Phe Gly Val Asp Val Thr 






20 










25 




30 


Thr 


Lys Glu 


He 


Val 


Leu 


Ala 


Asp 


Val 


He 


Asp Asn Asp Ser Trp Arg 




35 










40 






45 


Leu 


Trp Pro 


Ser 


Gly 


Asp 


Arg 


Ser 


Gin 


Gin 


Lys Asp Lys Gin Ser Tyr 




50 








55 








60 


Arg 


Asp Leu 


Lys 


Glu 


Val 


Thr 


Pro 


Glu 


Gly 


Leu Gin Met Val Lys Lys 


65 






70 










75 80 


Asn 


Phe Glu 


Trp 


Val 


Ala 


Glu 


Arg 


Val 


Glu 


Leu Leu Leu Lys Ser Glu 






85 










90 


95 


Ser 


Gin Cys 


Arg 


Val 


Val 


Val 


Leu 


Met 


Gly 


Ser Thr Ser Asp Leu Gly 




100 










105 




110 


His 


Cys Glu 


Lys 


He 


Lys 


Lys 


Ala 


Cys 


Gly 


Asn Phe Gly He Pro Cys 




115 










120 






125 


Glu 


Leu Arg 


Val 


Thr 


Ser 


Ala 


His 


Lys 


Gly 


Pro Asp Glu Thr Leu Arg 




130 








135 








140 


He 


Lys Ala 


Glu 


Tyr 


Glu 


Gly 


Asp 


Gly 


He 


Pro Thr Val Phe Val Ala 


145 








150 








155 160 


Val 


Ala Gly 


Arg 


Ser 


Asn 


Gly 


Leu 


Gly 


Pro 


Val Met Ser Gly Asn Thr 








165 










170 


175 


Ala 


Tyr Pro 


Val 


He 


Ser 


Cys 


Pro 


Pro 


Leu 


Thr Pro Asp Trp Gly Val 






180 










185 




190 


Gin 


Asp Val 


Trp 


Ser 


Ser 


Leu 


Arg 


Leu 


Pro 


Ser Gly Leu Gly Cys Ser 




195 








200 






205 


Thr 


Val Leu 


Ser 


Pro 


Glu 


Gly 


Ser 


Ala 


Gin 


Phe Ala Ala Gin He Phe 




210 








215 








220 


Gly 


Leu Ser 


Asn 


His 


Leu 


Val 


Trp 


Ser 


Lys 


Leu Arg Ala Ser He Leu 


225 








230 








235 240 


Asn 


Thr Trp 


He 


Ser 


Leu 


Lys 


Gin 


Ala 


Asp 


Lys Lys He Arg Glu Cys 








245 










250 


255 



Asn Leu 



(2) INFORMATION FOR SEQ ID NO:151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID N0:151: 

He Gin Arg Phe Gly Thr Ser Gly His He Met Asn Leu Gin Ala Gin 
15 10 15 



— 86 — 



Pro Lys 

Pro Ala 

I le Arg 
50 

Ala Val 
65 

Ala Leu 

Met Leu 

Ala Pro 

Ser Ser 
130 
Asn Cys 
145 

Pro Gly 

Arg Glu 

Met Pro 

Asn Ser 
210 
Gin Pro 
225 

Arg Gin 

Lys Gin 

Ala Ala 

Gin Gin 
290 
Pro Leu 
305 

Phe Pro 

Gin Asp 

Pro Arg 

Ala Leu 
370 
Leu Phe 
385 

Gly Gin 
Ser Gin 
Arg Glu 



Ala Gin Asn 
20 

Pro Lys Glu 
35 

Val Lys Glu 



Gin Pro Pro 
40 

Glu Gin Tyr 
55 

Ser Thr Ser Gin Pro Val 
70 

Val Val Tyr 



Lys Arg Lys Arg Cys Leu Phe 
25 

Pro Leu Gin Pro 



Leu Asn Ser 
85 

Ser Gin Gin 

100 
Gly Arg Gly 
115 

Trp Gin Gin 



Leu Gly His Glu 
60 

Glu Leu Pro Pro 
75 

Gly Pro Glu Arg 
90 

Val Ala Ser Val Lys Trp Pro 
105 

Gly Gly Gly Gly 



Pro Glu Arg 
120 

Gin Pro Gly 
135 

His Ser Leu Ser Leu Tyr 
150 

Pro Thr Tyr 



Val Gly Val 
165 

Lys Ala Gly 

180 
Gin Lys Val 
195 

Phe His Ala 



Gin Leu Glu 
200 

Ala Lys Lys 
215 

Phe Gin Leu Ala Phe Gly 
230 

Pro Pro Asn 



Gin Pro Pro Pro 
140 

Ser Ala Thr Lys 
155 

Tyr Asn His Pro 
170 

Gly Pro Gin Leu Asp Arg Tyr 
185 

Val Gly Arg Pro 



Gly Pro Pro 
245 

Gin Gin Gin 

260 
Leu Pro Gin 
275 

Pro Ser Gin 



Pro Pro Asn Gin 
220 

His Gin Val Asn 
235 

Pro Val Ala Ala 
250 

Gin Gin Pro Gin Gin Gin Gin 
265 

Phe Glu Asn Phe 



Met Pro Leu 
280 

Gin Pro Gin 
295 

Gly Gin Ser His Leu Ala 
310 

Asp Met Asn 



Pro Asn Pro 
325 

Ser Ala Pro 

340 
Arg Ser Arg 
355 

Asp Gly Ala 



Arg Leu Ser 
360 

Gly Thr Gin 
375 

Leu His His Trp Pro Leu 
390 

Glu Ala Leu 



Asp Phe Gly Leu 
300 

His His Ser Met 
315 

Pro Glu Leu Arg 
330 

Gin Pro Ala Leu Pro Gin Val 
345 

Lys Glu Gly lie 



Pro His Pro 
405 

Leu Leu Pro 

420 
Ala Pro Ala 
435 

Asp Cys Gly 



Pro Gly Gin Glu 
380 

Gin Gin Pro Pro 
395 

Gly Phe Pro Leu 
410 

Asp Gly Glu Arg Leu Ala Pro 
425 

Glu Glu Gly Met 



Thr Gly 
450 

Arg Arg Arg Arg Arg 
465 

Gin Lys 



Met Gly Ser 
440 

Gin Val Leu 

455 
Ala Ser Gin 
470 

Leu Ala Ser 



Ala Val Glu 
485 

Gly Ser Glu Glu Lys Arg Lys Ser 
500 

Gly Val Glu Phe Ser 
515 

Gly Met Val 



Asp Ser 
530 

Thr Val Asp Pro Thr 
545 

Gly Lys 



Val Thr 
Gin Ala 



Gly Leu Glu 
565 

Arg Arg Arg 

580 
Glu Asp Met 



Glu Pro Ser 
520 

Pro Leu lie 

535 
Glu Ala Ala 
550 

Gin Asn Pro 
Ser Thr Arg 
Asn Val Lys 



Arg Gly Gly Val 
460 

Glu Ala Asn Leu 
475 

Leu Gin Asn Ala 
490 

Val Leu Ala Ser 
505 

Leu Ala Thr Lys 

lie Pro Val Ser 
540 

Gin Ala Gly Gly 
555 

Ala Glu His Lys 
570 

lie Pro Gly Thr 
585 

Leu Glu Gly Glu 
— 87 — 



Gly Gly 

30 
Pro Gin 
45 

Gly Pro 

Pro Ser 

Thr Ser 

Asn Ser 
110 
Gly Val 
125 

His Ser 

Gly Ser 

Glu Ala 

Val Arg 
190 
Gin Ala 
205 

Ser Leu 

Arg Gin 

Phe Pro 

Gin Gin 
270 
Tyr Ser 
285 

Gin Pro 

Ala Pro 

Lys Ala 

Gin lie 
350 
Leu Pro 
365 

Ala Thr 

Pro Gly 

Glu Leu 

Asn Gly 
430 
Arg Ala 
445 

He Gin 



Gin Glu 

Gin Ser 

Gly Gly 

Ser Leu 

80 
Ala Ala 
95 

Val Met 

Ser Asp 

Thr Trp 

Pro His 
160 
Leu Lys 
175 

Pro Met 

Pro Leu 

Pro Leu 

Val Phe 
240 
Pro Gin 
255 

Gin Gin 

Met Pro 

Ala Gly 

Tyr Pro 
320 
Leu Leu 
335 

Pro Phe 

Pro Ser 

Gly Asn 

Ser Leu 
400 
Arg Glu 
415 

Arg Glu 
Val Ser 
Ser Thr 



Leu Thr 

Lys Asp 

Thr Thr 
510 
Arg Ala 
525 

Val Pro 
Leu Asp 
Pro Ser 



Leu Ala 
480 
Gly Ser 
495 

Lys Cys 

Arg Glu 

Val Arg 

Glu Asp 
560 
Val He 
575 

Gin Ala 



Asp Ala i 
590 

Pro Ser Val Arg 



595 600 605 



Lys 


Pro 


Lys 


Gin 


Arg 


Pro 


Arg 


Pro 


Glu 


Pro 


Leu 


He 


He Pro 


Thr 


Lys 


610 








615 










620 








Ala 


Gly 


Thr 


Phe 


He 


Ala 


Pro 


Pro 


Val 


Tyr 


Ser 


Asn 


He Thr 


Pro 


Tyr 


625 










630 










635 








640 


Gin 


Ser 


His 


Leu 


Arg 


Ser 


Pro 


Val 


Arg 


Leu 


Ala 


Asp 


His Pro 


Ser 


Glu 










645 










650 








655 




Arg 


Ser 


Phe 


Glu 


Leu 


Pro 


Pro 


Tyr 


Thr 


Pro 


Pro 


Pro 


He Leu 


Ser 


Pro 








660 










665 








670 






Val 


Arg 


Glu 


Gly 


Ser 


Gly 


Leu 


Tyr 


Phe 


Asn 


Ala 


He 


He Ser 


Thr 


Ser 






675 










680 










685 






Thr 


He 


Pro 


Ala 


Pro 


Pro 


Pro 


He 


Thr 


Pro 


Lys 


Ser 


Ala His 


Arg 


Thr 




690 










695 








700 








Leu 


Leu 


Arg 


Thr 


Asn 


Ser 


Ala 


Glu 


Val 


Thr 


Pro 


Pro 


Val Leu 


Ser 


Val 


705 










710 










715 








720 


Met 


Gly 


Glu 


Ala 


Thr 


Pro 


Val 


Ser 


He 


Glu 


Pro 


Arg 


He Asn 


Val 


Gly 










725 










730 








735 




Ser 


Arg 


Phe 


Gin 


Ala 


Glu 


lie 


Pro 


Leu 


Met 


Arg 


Asp 


Arg Ala 


Leu 


Ala 








740 










745 






750 






Ala 


Ala 


Asp 


Pro 


His 


Lys 


Ala 


Asp 


Leu 


Val 


Trp 


Gin 


Pro Trp 


Glu 


Asp 






755 










760 










765 






Leu 


Glu 


Ser 


Ser 


Arg 


Glu 


Lys 


Gin 


Arg 


Gin 


Val 


Glu 


Asp Leu 


Leu 


Thr 




770 










775 










780 






Ala 


Ala 


Cys 


Ser 


Ser 


He 


Phe 


Pro 


Gly 


Ala 


Gly 


Thr 


Asn Gin 


Glu 


Leu 


785 








790 








795 








800 


Ala 


Leu 


His 


Cys 


Leu 


His 


Glu 


Ser 


Arg 


Gly 


Asp 


He 


Leu Glu 


Thr 


Leu 










805 










810 






815 




Asn 


Lys 


Leu 


Leu 


Leu 


Lys 


Lys 


Pro 


Leu 


Arg 


Pro 


His 


Asn His 


Pro 


Leu 








820 






825 








830 






Ala 


Thr 


Tyr 


His 


Tyr 


Thr 


Gly 


Ser 


Asp 


Gin 


Trp 


Lys 


Met Ala 


Glu 


Arg 






835 










840 








845 






Lys 


Leu 


Phe 


Asn 


Lys 


Gly 


He 


Ala 


He 


Tyr 


Lys 


Lys 


Asp Phe 


Phe 


Leu 




850 










855 










860 








Val 


Gin 


Lys 


Leu 


He 


Gin 


Thr 


Lys 


Thr 


Val 


Ala 


Gin 


Cys Val 


Glu 


Phe 


865 








870 








875 








880 


Tyr 


Tyr 


Thr 


Tyr 


Lys 


Lys 


Gin 


Val 


Lys 


He 


Gly 


Arg 


Asn Gly 


Thr 


Leu 










885 










890 








895 




Thr 


Phe 


Gly 


Asp 


Val 


Asp 


Thr 


Ser 


Asp 


Glu 


Lys 


Ser 


Ala Gin 


Glu 


Glu 








900 










905 








910 






Val 


Glu 


Val 


Asp 


He 


Lys 


Thr 


Ser 


Gin 


Lys 


Phe 


Pro 


Arg Val 


Pro 


Leu 






915 






920 








925 






Pro 


Arg 


Arg 


Glu 


Ser 


Pro 


Ser 


Glu 


Glu 


Arg 


Leu 


Glu 


Pro Lys 


Arg 


Glu 




930 










935 










940 








Val 


Lys 


Glu 


Pro 


Arg 


Lys 


Glu 


Gly 


Glu 


Glu 


Glu 


Val 


Pro Glu 


He 


Gin 


945 






950 








955 








960 


Glu 


Lys 


Glu 


Glu 


Gin 


Glu 


Glu 


Gly 


Arg 


Glu 


Arg 


Ser 


Arg Arg 


Ala 


Ala 










965 










970 








975 




Ala 


Val 


Lys 


Ala 


Thr 


Gin 


Thr 


Leu 


Gin 


Ala 


Asn 


Glu 


Ser Ala 


Ser 


Asp 








980 










985 








990 






He 


Leu 


He 


Leu 


Arg 


Ser 


His 


Glu 


Ser 


Asn 


Ala 


Pro 


Gly Ser 


Ala 


Gly 






995 










1000 








1005 






Gly 


Gin 


Ala 


Ser 


Glu 


Lys 


Pro 


Arg Glu 


Gly 


Thr 


Gly 


Lys Ser 


Arg 


Arg 




1010 








1015 








1020 






Ala 


Leu Pro 


Phe 


Ser 


Glu 


Lys Lys Lys 


Lys 


Lys 


Gin Lys Ala 






1025 








1030 








1035 









(2) INFORMATION FOR SEQ ID NO:152: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 849 amino acids 
<B) TYPE: amino acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

He Arg His Glu Val Ser Phe Leu Trp Asn Thr Glu Ala Ala Cys Pro 

15 10 15 

lie Gin Thr Thr Thr Asp Thr Asp Gin Ala Cys Ser He Arg Asp Pro 

20 25 30 

Asn Ser Gly Phe Val Phe Asn Leu Asn Pro Leu Asn Ser Ser Gin Gly 
35 40 45 

— 88 — 



lyr 




Val 


Ser 


biy 


I le 


uiy 




I le 


Phe 


Met 


Phe 


Asn 


Val 


Cys 


uiy 




50 










55 










60 










Thr 


Met 


Pro 


Val 


Cys 


tiiy 


Thr 


I le 


Leu 


Gly 


Lys 


Pro 


Ala 


Ser 


tiiy 




65 










70 










75 










80 


rl it 
ulU 


Ala 


Glu 


inr 


lain 


Thr 


rln 
ulU 


ulU 


Leu 


Lys 


Asn 


Trp 


Lys 


Pro 


A f a 

Aia 


Arg 










85 










on 










0<^ 
yj 




Pro 


Val 


Gly 


I le 


Glu 


Lys 


Ser 


Leu 


Gin 


Leu 


Ser 


u 

Thr 


Glu 


Gly 


Phe 


I le 






100 




















i in 






Thr 


Leu 


Thr 


Tyr 


Lys 


Gly 


Pro 


Leu 


Ser 


Ala 


Lys 


Gly 


1 nr 


A 1 a 

Aia 


Asp 


Ala 






115 










120 










125 








Phe 


lie 


Val 


Arg 


Phe 


Val 


cys 


Asn 


Asp 


flsp 


Val 


T 

Tyr 


Ser 


r.l « 


Pro 


Leu 




130 










135 










140 










Lys 


rne 


Leu 


HIS 


rl 

n 


Asp 


He 


sp 


er 


y 


Gin 


Gly 


I le 


Arg 


sn 


Thr 


145 










150 










155 










160 


Tyr 


oka 

rne 


Glu 


rne 


. 

GlU 


Thr 


Ala 


Leu 


Ala 


cys 


Val 


Pro 


Ser 


Pro 


Val 


Asp 










too 










17fl 










175 


Cys 


Gin 


val 


Till* 


Asp 


Leu 


Ala 


laiy 


Asn 


rln 
ulU 


Tyr 


Asp 


Leu 


i nr 


Gly 


Leu 




























190 






Ser 


Thr 


Val 


Arg 


Lys 


Pro 


Trp 


Thr 


Ala 


Val 


Asp 


Thr 


Ser 


Val 


Asp 


Gly 






195 










200 










205 








Arg 


Lys 


Arg 


Thr 


Phe 


Tyr 


Leu 


Ser 


Val 


Cys 


Asn 


Pro 


Leu 


Pro 


Tyr 


He 




210 










215 










220 










Pro 


Gly 


Cys 


Gin 


Gly 


Ser 


Ala 


Val 


Gly 


Ser 


Cys 


Leu 


Val 


Ser 


Glu 


Gly 


225 










230 










235 










240 


Asn 


Ser 


Trp 


Asn 


Leu 


Gly 


Val 


Val 


Gin 


Met 


Ser 


Pro 


Gin 


Ala 


Ala 


Ala 










245 
























Asn 


Gly 


Ser 


Leu 


Ser 


I le 


Met 


Tyr 


Val 


Asn 


Gly 


Asp 


Lys 


Cys 


Gly 


Asn 








COU 










OAK 
COD 










270 






Gin 


Arg 


Phe 


Ser 


Thr 


Arg 


I le 


Thr 


Phe 


Glu 


Cys 


Ala 


Gin 


I le 


Ser 


Gly 
















280 
















Ser 


Pro 


Ala 


Phe 


Gin 


Leu 


Gin 


Asp 


Gly 


Cys 


Glu 


Tyr 


Val 


Phe 


I le 


Trp 




290 










295 




300 








Arg 


Thr 


Val 


Glu 


Ala 


Cys 


Pro 


Val 


Val 


Arg 


Val 


Glu 


Gly 


Asp 


Asn 


Cys 


305 










310 










315 










320 


Glu 


Val 


Lys 


Asp 


Pro 


Arg 


His 


Gly 


Asn 


Leu 


Tyr 


Asp 


Leu 


Lys 


Pro 


Leu 










325 










330 










335 




Gly 


Leu 


Asn 


Asp 


Thr 


He 


Val 


Ser 


Ala 


Gly 


Glu 


Tyr 


Thr 


Tyr 


Tyr 


Phe 








340 










345 










350 






Arg 


Val 


Cys 


Gly 


Lys 


Leu 


Ser 


Ser 


Asp 


Val 


Cys 


Pro 


Thr 


Ser 


Asp 


Lys 






355 










360 










365 








Ser 


Lys 


Val 


Val 


Ser 


Ser 


Cys 


Gin 


Glu 


Lys 


Arg 


Glu 


Pro 


Gin 


Gly 


Phe 




370 










375 










380 










His 


Lys 


Val 


Ala 


Gly 


Leu 


Leu 


Thr 


Gin 


Lys 


Leu 


Thr 


Tyr 


Glu 


Asn 


Gly 


385 






390 










395 










400 


Leu 


Leu 


Lys 


Met 


Asn 


Phe 


Thr 


Gly 


Gly 


Asp 


Thr 


Cys 


His 


Lys 


Val 


Tyr 










405 










410 










415 




Gin 


Arg 


Ser 


Thr 


Ala 


He 


Phe 


Phe 


Tyr 


Cys 


Asp 


Arg 


Gly 


Thr 


Gin 


Arg 








420 










425 










430 






Pro 


Val 


Phe 


Leu 


Lys 


Glu 


Thr 


Ser 


Asp 


Cys 


Ser 


Tyr 


Leu 


Phe 


Glu 


Trp 






435 










440 










445 








Arg 


Thr 


Gin 


Tyr 


Ala 


Cys 


Pro 


Pro 


Phe 


Asp 


Leu 


Thr 


Glu 


Cys 


Ser 


Phe 




450 










455 










460 










Lys 


Asp 


Gly 


Ala 


Gly 


Asn 


Ser 


Phe 


Asp 


Leu 


Ser 


Ser 


Leu 


Ser 


Arg 


Tyr 


465 










470 










475 










480 


Ser 


Asp 


Asn 


Trp 


Glu 


Ala 


I le 


Thr 


Gly 


Thr 


Gly 


Asp 


Pro 


Glu 


His 


Tyr 










485 










490 










495 




Leu 


He 


Asn 


Val 


Cys 


Lys 


Ser 


Leu 


Ala 


Pro 


Gin 


Ala 


Gly 


Thr 


Glu 


Pro 








500 










505 










510 






Cys 


Pro 


Pro 


Glu 


Ala 


Ala 


Ala 


Cys 


Leu 


Leu 


Gly 


Gly 


Ser 


Lys 


Pro 


Val 






515 










520 










525 








Asn 


Leu 


Gly 


Arg 


Val 


Arg 


Asp 


Gly 


Pro 


Gin 


Trp 


Arg 


Asp 


Gly 


lie 


I le 




530 










535 










540 










Val 


Leu 


Lys 


Tyr 


Val 


Asp 


Gly 


Asp 


Leu 


Cys 


Pro 


Asp 


Gly 


lie 


Arg 


Lys 


545 










550 










555 










560 


Lys 


Ser 


Thr 


Thr 


He 


Arg 


Phe 


Thr 


Cys 


Ser 


Glu 


Ser 


Gin 


Val 


Asn 


Ser 










565 










570 










575 




Arg 


Pro 


Met 


Phe 


He 


Ser 


Ala 


Val 


Glu 


Asp 


Cys 


Glu 


Tyr 


Thr 


Phe 


Ala 








580 










585 








590 






Trp 


Pro 


Thr 


Ala 


Thr 


Ala 


Cys 


Pro 


Met 


Lys 


Ser 


Asn 


Glu 


His 


Asp 


Asp 






595 










600 










605 








Cys 


Gin 


Val 


Thr 


Asn 


Pro 


Ser 


Thr 


Gly 


His 


Leu 


Phe 


Asp 


Leu 


Ser 


Ser 




610 










615 










620 










Leu 


Ser 


Gly 


Arg 


Ala 


Gly 


Phe 


Thr 


Ala 


Ala 


Tyr 


Ser 


Glu 


Lys 


Gly 


Leu 
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625 










630 










635 


640 


Val 


Tyr 


Met 


Ser 


lie 


Cys 


Gly 


Glu 


Asn 


Glu 


Asn Cys 


Pro Pro Gly Val 








645 










650 




655 


Gly 


Ala 


Cys 


Phe 


Gly 


Gin 


Thr 


Arg 


I le 


Ser 


Val Gly 


Lys Ala Asn Lys 








660 










665 






670 


Arg 


Leu 


Arg 


Tyr 


Val 


Asp 


Gin 


Vat 


Leu 


Gin 


Leu Val 


Tyr Lys Asp Gly 






675 










680 








685 


Ser 


Pro 


Cys 


Pro 


Ser 


Lys 


Ser 


Gly 


Leu 


Ser 


Tyr Lys 


w«» i fi« rei- 
ser vai lie ser 




690 








695 








700 




Phe 


Val 


Cys 


Arg 


Pro 


Glu 


Ala 


Gly 


Pro 


Thr 


Asn Arg 


Pro Met Leu I le 


705 








710 








715 


720 


Ser 


Leu 


Asp 


Lys 


Gin 


Thr 


Cys 


Thr 


Leu 


Phe 


Phe Ser 


Trp His Thr Pro 








725 










730 




735 


Leu 


Ala 


Cys 


Glu 


Gin 


Ala 


Thr 


Glu 


Cys 


Ser 


Val Arg 


Asn Gly Ser Ser 






740 










745 






750 


He 


Val 


Asp 


Leu 


Ser 


Pro 


Leu 


lie 


His 


Arg 


Thr Gly 


Gly Tyr Glu Ala 






755 










760 








765 


Tyr 


Asp 


Glu 


Ser 


Glu 


Asp 


Asp 


Ala 


Ser 


Asp 


Thr Asn 


Pro Asp Phe Tyr 




770 










775 








780 




He 


Asn 


Ue 


Cys 


Gin 


Pro 


Leu 


Asn 


Pro 


Met 


His Gly 


Val Pro Cys Pro 


785 








790 










795 


800 


Ala 


Gly 


Ala 


Ala 


Val 


Cys 


Lys 


Val 


Pro 


He 


Asp Gly 


Pro Pro lie Asp 










805 










810 




815 


He 


Gly 


Arg 


Val 


Ala 


Gly 


Pro 


Pro 


He 


Leu 


Asn Pro 


He Ala Asn Glu 








820 










825 






830 


He 


Tyr 


Leu 


Asn 


Phe 


Glu 


Ser 


Ser 


Thr 


Pro 


Cys Gin 


Glu Phe Ser Cys 




835 










840 








845 



Lys 



(2) INFORMATION FOR SEQ ID NO: 153: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 852 amino acids 
<B> TYPE: amino acid 
(C> STRANDED NESS: single 
(D) TOPOLOGY: linear 





(xi) SEQUENCE 


DESCRIPTION 


Met 


Ala 


Arg 


Leu 


Ser 


Arg 


Pro 


Glu 


1 








5 








Glu 


Asp 


Leu 


Pro 


Tyr 


Glu 


Glu 


Glu 






20 










Lys 


Cys 


Trp 


Leu 


His 


Tyr 


He 


Glu 






35 










40 


Arg 


Leu 


Asn 


Gin 


Leu 


Tyr 


Glu 


Arg 




50 










55 




Tyr 


Lys 


Leu 


Trp 


Tyr 


Arg 


Tyr 


Leu 


65 










70 






His 


Arg 


Cys 


Val 


Thr 


Asp 


Pro 


Ala 










85 








Glu 


Arg 


Ala 


Phe 


Val 


Phe 


Met 


His 








100 










Tyr 


Cys 


Gin 


Phe 


Leu 


Met 


Asp 


Gin 






115 










120 


Thr 


Phe 


Asp 


Arg 


Ala 


Leu 


Arg 


Ala 




130 








135 




He 


Trp 


Pro 


Leu 


Tyr 


Leu 


Arg 


Phe 


145 










150 






Thr 


Ala 


Val 


Arg 


Gly 


Tyr 


Arg 


Arg 










165 








Ala 


Glu 


Glu 


Tyr 


He 


Glu 


Tyr 


Leu 








180 










Ala 


Ala 


Gin 


Arg 


Leu 


Ala 


Thr 


Val 






195 










200 


Lys 


Ala 


Gly 


Lys 


Ser 


Asn 


Tyr 


Gin 




210 










215 




He 


Ser 


Gin 


Asn 


Pro 


Asp 


Lys 


Val 


225 










230 






He 


Arg 


Gly 


Gly 


Leu 


Thr 


Arg 


Phe 



245 



SEQ ID 


NO:153: 




Arg 


Pro 


Asp 


Leu 


Val Phe Glu Glu 




10 






15 


He 


Met 


Arg 


Asn 


Gin Phe Ser Val 


25 








30 


Phe 


Lys 


Gin 


Gly 


Ala Pro Lys Pro 








45 


Ala 


Leu 


Lys 


Leu 


Leu Pro Cys Ser 








60 




Lys 


Ala 


Arg 


Arg 


Ala Gin Val Lys 






75 




80 


Tyr 


Glu 


Asp 


Val 


Asn Asn Cys His 




90 






95 


Lys 


Met 


Pro 


Arg 


Leu Trp Leu Asp 


105 








110 


Gly 


Arg 


Val 


Thr 


His Thr Arg Arg 










125 


Leu 


Pro 


He 


Thr 


Gtn His Ser Arg 








140 




Leu 


Arg 


Ser 


His 


Pro Leu Pro Glu 






155 




160 


Phe 


Leu 


Lys 


Leu 


Ser Pro Glu Ser 




170 






175 


Lys 


Ser 


Ser 


Asp 


Arg Leu Asp Glu 


185 








190 


Val 


Asn 


Asp 


Glu 


Arg Phe Val Ser 








205 


Leu 


Trp 


His 


Glu 


Leu Cys Asp Leu 








220 




Gin 


Ser 


Leu 


Asn 


Val Asp Ala I le 






235 




240 


Thr 


Asp 


Gin 


Leu 


Gly Lys Leu Trp 




250 






255 
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Cys 


Ser 


Leu 


Ala 


Asp 


iyr 


Tup 

iyr 


T 1 Q 

lie 


Arg 


Cap 

oer 


uiy 


His 


rne 


Glu 


Lys Hia 








con 










265 










270 




Arg 


Asp 


val 


Tyr 


GlU 


GlU 


Ala 


I le 


Arg 


Thr 


Val 


Met 


1 nr 


\/o i 

vai 


Arg Asp 














280 
















Pne 


i nr 


Gin 


va i 


Phe 


Asp 


Ser 


Tyr 


Ala 


uin 


Pne 


ni 1 1 
ulU 


Glu 


er 


Met I le 














295 










300 








Ala 


Ala 


Lys 


Met 


Glu 


i nr 


Ala 


Ser 


GlU 


Leu 


Gly 


Arg 


. 

Glu 


Glu 


Glu Asp 


305 










310 










315 










Asp 


Val 


Asp 


Leu 


Glu 


Leu 


Arg 


Leu 


Ala 


Arg 


Phe 


Glu 


Gin 


Leu 


He Ser 






325 










330 












Arg 


Arg 


Pro 


Leu 


Leu 


Leu 


Asn 


Ser 


Val 


Leu 


Leu 


Arg 


Gin 


Asn 


Pro His 








340 










345 










350 




His 


Val 


His 


Glu 


Trp 


His 


Lys 


Arg 


Val 


Ala 


Leu 


His 


Gin 


Gly 


Arg Pro 






355 










360 










365 






Arg 


Glu 


He 


I le 


Asn 


Thr 


Tyr 


Thr 


Glu 


Ala 


Val 


Gin 


Thr 


Val 


Asp Pro 




370 










375 










380 








Phe 


Lys 


Ala 


Thr 


Gly 


Lys 


Pro 


His 


Thr 


Leu 


Trp 


Val 


Ala 


Phe 


Ala Lys 


385 








390 










395 








400 


Phe 


Tyr 


Glu 


Asp 


Asn 


Gly 


Gin 


Leu 


Asp 


Asp 


Ala 


Arg 


Val 


I le 


Leu Glu 








405 










410 










415 


Lys 


Ala 


Thr 


Lys 


Val 


Asn 


Phe 


Lys 


Gin 


Val 


Asp 


Asp 


Leu 


Ala 


Ser Val 








420 










425 










430 




Trp 


Cys 


Gin 


Cys 


Gly 


Glu 


Leu 


Glu 


Leu 


Arg 


His 


Glu 


Asn 


Tyr 


Asp Glu 






435 










440 










445 






Ala 


Leu 


Arg 


Leu 


Leu 


Arg 


Lys 


Ala 


Thr 


Ala 


Leu 


Pro 


Ala 


Arg 


Arg Ala 




450 










455 










460 








Glu 


Tyr 


Phe 


Asp 


Gly 


Ser 


Glu 


Pro 


Val 


Gin 


Asn 


Arg 


Val 


Tyr 


Lys Ser 


465 








470 










475 








480 


Leu 


Lys 


Val 


Trp 


Ser 


Met 


Leu 


Ala 


Asp 


Leu 


Glu 


Glu 


Ser 


Leu 


Gly Thr 






485 








490 










495 


Phe 


Gin 


Ser 


Thr 


Lys 


Ala 


Val 


Tyr 


Asp 


Arg 


I le 


Leu 


Asp 


Leu 


Arg I le 








500 










505 










510 




Ala 


Thr 


Pro 


Gin 


He 


Val 


He 


Asn 


Tyr 


Ala 


Met 


Phe 


Leu 


GlU 


Glu His 






515 










520 










525 






Lys 


Tyr 


Phe 


Glu 


Glu 


Ser 


Phe 


Lys 


Ala 


Tyr 


Glu 


Arg 


Gly 


He 


Ser Leu 




530 










535 










540 








Phe 


Lys 


Trp 


Pro 


Asn 


Val 


Ser 


Asp 


He 


Trp 


Ser 


Thr 


Tyr 


Leu 


Thr Lys 


545 










550 










555 








560 


Phe 


He 


Ala 


Arg 


Tyr 


Gly 


Gly 


Arg 


Lys 


Leu 


Glu 


Arg 


Ala 


Arg 


Asp Leu 










565 










570 










575 


Phe 


Glu 


Gin 


Ala 


Leu 


Asp 


Gly 


Cys 


Pro 


Pro 


Lys 


Tyr 


Ala 


Lys 


Thr Leu 








580 










585 










590 




Tyr 


Leu 


Leu 


Tyr 


Ala 


Gin 


Leu 


Glu 


Glu 


Glu 


Trp 


Gly 


Leu 


Ala 


Arg His 




595 








600 








605 






Ala 


Met 


Ala 


Val 


Tyr 


Glu 


Arg 


Ala 


Thr 


Arg 


Ala 


Val 


Glu 


Pro 


Ala Gin 




610 








615 










620 








Gin 


Tyr 


Asp 


Met 


Phe 


Asn 


lie 


Tyr 


He 


Lys 


Arg 


Ala 


Ala 


Glu 


He Tyr 


625 










630 










635 








640 


Gty 


Val 


Thr 


His 


Thr 


Arg 


Gly 


He 


Tyr 


Gin 


Lys 


Ala 


He 


Glu 


Val Leu 










645 










650 










655 


Ser 


Asp 


Glu 


His 


Ala 


Arg 


Glu 


Met 


Cys 


Leu 


Arg 


Phe 


Ala 


Asp 


Met Glu 






660 










665 










670 




Cys 


Lys 


Leu 


Gly 


Glu 


lie 


Asp 


Arg 


Ala 


Arg 


Ala 


He 


Tyr 


Ser 


Phe Cys 






675 










680 










685 






Ser 


Gin 


He 


Cys 


Asp 


Pro 


Arg 


Thr 


Thr 


Gly 


Ala 


Phe 


Trp 


Gin 


Thr Trp 




690 










695 










700 








Lys 


Asp 


Phe 


Glu 


Val 


Arg 


His 


Gly 


Asn 


Glu 


Asp 


Thr 


He 


Lys 


Glu Met 


705 










710 










715 








720 


Leu 


Arg 


He 


Arg 


Arg 


Ser 


Val 


Gin 


Ala 


Thr 


Tyr 


Asn 


Thr 


Gin 


Val Asn 










725 










730 










735 


Phe 


Met 


Ala 


Ser 


Gin 


Met 


Leu 


Lys 


Val 


Ser 


Gly 


Ser 


Ala 


Thr 


Gly Thr 








740 










745 










750 




Val 


Ser 


Asp 


Leu 


Ala 


Pro 


Gly 


Gin 


Ser 


Gly 


Met 


Asp 


Asp 


Met 


Lys Leu 






755 










f oU 










765 






Leu 


Glu 


Gin 


Arg 


Ala 


Glu 


Gin 


Leu 


Ala 


Ala 


Glu 


Ala 


Glu 


Arg 


Asp Gin 




770 










775 










780 








Pro 


Leu 


Arg 


Ala 


Gin 


Ser 


Lys 


He 


Leu 


Phe 


Val 


Arg 


Ser 


Asp 


Ala Ser 


785 










790 










795 








800 


Arg 


Glu 


Glu 


Leu 


Ala 


Glu 


Leu 


Ala 


Gin 


Gin 


Val 


Asn 


Pro 


Glu 


Glu He 










805 










810 










815 


Gin 


Leu 


Gly 


Glu 


Asp 


Glu 


Asp 


Glu 


Asp 


Glu 


Met 


Asp 


Leu 


Glu 


Pro Asn 








820 










825 










830 




Glu 


Val 


Arg 


Leu 


Glu 


Gin 


Gin 


Ser 


Val 


Pro 


Ala 


Ala 


Val 


Phe 


Gly Ser 
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835 

Leu Lys Glu Asp 
850 



840 



845 



(2) INFORMATION FOR SEQ ID NO:154: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 693 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO:154: 


Met 


Phe 


Ser 


Ala 


Leu 


Lys 


Lys 


Leu 


Val 


Gly 


Ser 


Asp Gin Ala Pro Gly 


1 








5 










10 




15 


Arg 




Lys 


Asn 


He 


Pro 


Ala 


Gly 


Leu 


Gin 


Ser 


Met Asn Gin Ala Leu 




20 










25 






30 


Gin 


Arg 


Arg 


Phe 


Ala 


Lys 


GLv 
u.y 


Val 


Gin 


T 

lyr 


Asn 


Mpt I vi ! 1 p Val lip 

lie 1. l~]T*> *IC Hal 1 IC 






35 










40 








45 


Arg 


my 


Asp 


. 

Arg 




Thr 


uiy 


Lys 


Thr 


Ala 


Leu 


Tpn Uiq Am 1 pi i R 1 n 




50 










55 










60 


Gly 


Arg 


Pro 


Phe 


Val 


Glu 


Glu 


_ 

i yr 


He 


Pro 


Thr 


nln Rill Tip Rln Val 


65 










70 










75 


80 


Thr 




He 


His 


irp 


Ser 


Tyr 


Lys 


Thr* 

i nr 


Thr 


Asp 


Acrs TIa Vat I \/e5 Val 
ftap 1 le? Val Lys vat 










85 










90 




95 


Glu 


Val 


Trp 


Asp 


Val 


Val 


Asp 


Lys 


Gly 


Lys 


Cys 


Lys Lys Arg Gly Asp 








100 










105 






110 


Gly 


Leu 


Lys 


Met 


ulU 


Asn 


Asp 


Pro 


bin 


. 

Glu 


xaa 


Rl ii Col* fllii Mot Ala 

ulu ocr um net Mia 






115 










120 










Leu 


Asp 


/ua 


u lU 


Phe 


Leu 


Asp 


Val 


Tyr 


Lys 


Asn 


f*\/e Ben tl\ \i V/al \/at 
LyS ASM Illy Va. Val. 




130 










135 










140 


Met 


Met 


Phe 


Asp 


lie 


Thr 


Lys 


Gin 


Trp 


Thr 


Phe 


Asn Tyr He Leu Arg 


145 








150 










155 


160 


Glu 


Leu 


Pro 


Lys 


Val 


Pro 


Thr 


His 


Val 


Pro 


Val 


Cys Val Leu Gly Asn 










165 










170 




175 


Tyr 


Arg 


Asp 


Met 


Gly 


Glu 


His 


Arg 


Val 


I le 


Leu 


Pro Asp Asp Val Arg 








180 










185 






190 


Asp 


Pne 




Asp 


Asn 


Leu 


Asp 


Arg 


Pro 


Pro 


Gly 


Ser Ser Tyr Phe Arg 






195 










200 








205 


Tyr 


Ala 


Glu 


Ser 


Sep 


Met 


Lys 


Asn 


Ser 


Phe 


Gly 


Leu Lys Tyr Leu His 




210 










215 










220 


Lys 


Phe 


Pne 


Asn 


I le 


Pro 


Phe 


Leu 


Gin 


Leu 


Gin 


Arg Glu Thr Leu Leu 


225 










230 










235 




Arg 


Gin 


Leu 


GlU 


Thr 


Asn 


Gin 


Leu 


Asp 


Met 


Asp 


Ala Thr Ipn ftlii ftlii 
HI 4 ini LCU UIU 12 1 u 










245 










_ 

250 




CjjJ 


Leu 


Ser 


Val 


Gin 


Gin 


Glu 


Tnr 


Glu 


Asp 


Gin 


Asn 


Tyr Giy lie pne Leu 








260 










265 






270 


Glu 


Met 


Met 


Glu 


Ala 


Arg 


Ser 


Arg 


Gly 


His 


Ala 


Ser Pro Leu Ala Ala 






275 










280 








285 


Asn 


Gly 


Gin 


Ser 


Pro 


Ser 


Pro 


Gly 


Ser 


Gin 


Ser 


Pro Val Leu Pro Ala 




290 










295 










300 


Pro 


Ala 


Val 


Ser 


Thr 


Gly 


Ser 


Ser 


Ser 


Pro 


Gly 


Thr Pro Gin Pro Ala 


305 










310 










315 


320 


Pro 


Gin 


Leu 


Pro 


Leu 


Asn 


Ala 


Ala 


Pro 


Pro 


Ser 


Ser Val Pro Pro Val 










325 










330 




335 


Pro 


Pro 


Ser 


GlU 


Ala 


Leu 


Pro 


Pro 


Pro 


Ala 


Cys 


Pro Ser Ala Pro Ala 








340 










345 






350 


Pro 


Arg 


Arg 


Ser 


He 


He 


Ser 


Arg 


Leu 


Phe 


Gly 


Thr Ser Pro Ala Thr 






355 










360 








365 


Glu 


Ala 


Ala 


Pro 


Pro 


Pro 


Pro 


Glu 


Pro 


Val 


Pro 


Ala Ala Gin Gly Pro 




370 










375 










380 


Ala 


Thr 


Val 


Gin 


Ser 


Val 


Glu 


Asp 


Phe 


Val 


Pro 


Asp Asp Arg Leu Asp 


385 










390 










395 


400 


Arg 


Ser 


Phe 


Leu 


Glu 


Asp 


Thr 


Thr 


Pro 


Ala 


Arg 


Asp Glu Lys Lys Val 










405 










410 




415 


Gly 


Ala 


Lys 


Ala 


Ala 


Gin 


Gin 


Asp 


Ser 


Asp 


Ser 


Asp Gly Glu Ala Leu 








420 










425 






430 


Gly 


Gly 


Asn 


Pro 


Met 


Val 


Ala 


Gly 


Phe 


Gin 


Asp 


Asp Val Asp Leu Glu 






435 










440 








445 


Asp 


Gin 


Pro 


Arg 


Gly 


Ser 


Pro 


Pro 


Leu 


Pro 


Ala 


Gly Pro Val Pro Ser 


450 










455 










460 
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Gin 


Asp 


lie 


Thr 


Leu 




Ser 


Glu 


G lu Glu 


A 1 a 

Ala 


fit it Val A 1 a 
ULU Veil Ala 


At a 

Aia 












470 








Hf J 






480 


Thr 


Lys 


Gly 


Pro 


Ala 


Pro 


Ala 


Pro 


Gin Gin 


Cys 


Ser Glu Pro 


Glu 


Thr 










HOJ 














AOG 




Lys 


Trp 


er 


Ser 


lie 


ro 


A f a 

Aia 


er 




rg 


Arg Giy Thr 


Ala 


ro 




500 










505 




510 






Tkt* 

i nr 


Arg 


inr 


Ala 


Ala 


Pro 


Pro 


Trp 


Pro Gly 


Giy 


t/a| C ar » Vat 

vai oer vai 


Arg 


1 nr 






515 










520 






525 






Gly 


Pro 


Glu 


Lys 


Arg 


Ser 


Ser 


inr 


Arg Pro 


Pro 


Ala fit II Mat 

Aia uiu net 


r t 1 r 
UlU 


Pro 




530 


















540 






Gly 


Lys 


Gly 


Glu 


Gin 


Ala 


Ser 


Ser 


Ser Glu 


Ser 


Asp Pro Glu 


Gly 


Pro 


545 










550 








ecu 






560 


He 


Ala 


Ala 


Gin 


Met 


Leu 


Ser 


Phe 


Val Met 


Asp 


Asp Pro Asp 


Phe 


Glu 










565 








570 










Ser 


Glu 


Gly 


Ser 


Asp 


Thr 


Gin 


Arg 


Arg Ala 


Asp 


Asp Phe Pro 


Val 


Arg 








580 










585 




590 






Asp 


Asp 


Pro 


Ser 


Asp 


Val 


Thr 


Asp 


Glu Asp 


Glu 


Giy Pro Ala 


Glu 


Pro 






595 










600 






605 






Pro 


Pro 


Pro 


Pro 


Lys 


Leu 


Pro 


Leu 


Pro Ala 


Phe 


Arg Leu Lys 


Asn 


Asp 




610 










615 








620 






Ser 


Asp 


Leu 


Phe 


Gly 


Leu 


Gly 


Leu 


Glu Glu 


Ala 


Gly Pro Lys 


Glu 


Ser 


625 










630 








635 






640 


Ser 


Glu 


Glu 


Gly 


Lys 


Glu 


Gly 


Lys 


Thr Pro 


Ser 


Lys Glu Lys 


Lys 


Lys 










645 








650 






655 




Lys 


Thr 


Lys 


Ser 


Phe 


Ser 


Arg 


Val 


Leu Leu 


Glu 


Arg Pro Arg 


Ala 


His 






660 










665 




670 






Arg 


Phe 


Ser 


Thr 


Arg 


Val 


Gly 


Tyr 


Gin Val 


Ser 


Val Pro Asn 


Ser 


Pro 






675 










680 






685 






Tyr 


Ser 


Glu 


Ser 


Tyr 



















690 
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